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Abstract 

The Lovasz Local Lemma is a seminal result in probabilistic combinatorics. It gives a sufficient 
condition on a probability space and a collection of events for the existence of an outcome that simul¬ 
taneously avoids all of those events. Finding such an outcome by an efficient algorithm has been an 
active research topic for decades. Breakthrough work of Moser and Tardos (2009) presented an efficient 
algorithm for a general setting primarily characterized by a product structure on the probability space. 

In this work we present an efficient algorithm for a much more general setting. Our main assumption 
is that there exist certain functions, called resampling oracles, that can be invoked to address the unde¬ 
sired occurrence of the events. We show that, in all scenarios to which the original Lovasz Local Lemma 
applies, there exist resampling oracles, although they are not necessarily efficient. Nevertheless, for es¬ 
sentially all known applications of the Lovasz Local Lemma and its generalizations, we have designed 
efficient resampling oracles. As applications of these techniques, we present new results for packings of 
Latin transversals, rainbow matchings and rainbow spanning trees. 
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1 Introduction 


The Lovasz Local Lemma (LLL) is a powerful tool with numerous uses in combinatorics and theoretical 
computer science. If a given probability space and collection of events satisfy a certain condition, then the 
LLL asserts the existence of an outcome that simultaneously avoids those events. The classical formulation 
of the LLL ifTSl iTTl is as follows. 

Let be a probability space with probability measure /r. Let i?i,..., be certain “undesired” events 
in that space. Let G be an undirected graph with vertex set [n] = n}. The edges of G are denoted 

E{G). Let r(i) = { J 7 ^ i : {hj} G E{G) } be the neighbors of vertex i. Also, let = r(i) U {i} 

and let r+(/) = Uie/ ^''"(*) I ^VA- 

Theorem 1.1 (General Lovasz Local Lemma lITSllT/l l. Suppose that the events satisfy the following condi¬ 
tion that controls their dependences 

PT[Ei \ rij^jEj] = PT[Ei] Vi G [n], J c [n] \r+(i) (Dep) 

/i 

and the following criterion that controls their probabilities 

3xi,..., Xn € (0,1) such that Px\Ei] < x* (1 — Xj) Vi € [n]. (GLL) 

^ ier(i) 


Then Pr^[f|"=i Ei] > 0. 

An equivalent statement of ( |Dep i is that the event Ei must be independent of the joint distribution 
on the events { Ej : j ^ r+(i) }. When ( Dep| ) holds, G is called a dependency graph. The literature 
contains several dependency conditions generalizing (Dep I and criteria generalizing (IGLLI) under which the 
conclusion of the theorem remains true. We will discuss several such generalizations below. 

The LLL can also be formulated fh! in terms of a directed dependency graph instead of an undirected 
graph, but nearly all applications of which we are aware involve an undirected graph. Accordingly, our 
work focuses primarily on the undirected case, but we will mention below which of our results extend to the 
directed case. 


Algorithms. Algorithms to efficiently find an oufcome in nr=i been fhe subjecf of research for 

several decades. In 2008, a nearly optimal resulf was obfained by Moser ll29l for a canonical applicafion 
of fhe LLL, fhe bounded-degree A:-SAT problem. Shorfly fhereafler, Moser and Tardos extended fhaf 
resulf fo a general scenario called fhe “variable model” in which D consisfs of independenf variables, each 
Ei depends on a subsef of fhe variables, and evenfs Ei and Ej are adjacenf in G if fhere is a variable on 
which fhey bofh depend. Clearly fhe resulfing graph is a dependency graph. The Moser-Tardos algorifhm is 
exfremely simple: after drawing an initial sample of fhe variables, if repeatedly checks if any undesired even! 
occurs, fhen resamples any such evenf. Resampling an even! means fhaf fhe variables on which if depends 
receive fresh samples according fo p,. Moser and Tardos prove fhaf, if fhe (IGLLI) condifion is safisfied, fhis 
algorifhm will produced fhe desired oufcome affer af mosf resampling operations, in expecfation. 

Numerous exfensions of fhe Moser-Tardos algorifhm have been proposed. These exfensions can handle 
more general criteria ll24l [Tl l25l. derandomizafion |[T3l . exponentially many evenfs 1201 . disfribufed 
scenarios ni, efc. However, fhese resulfs are restricted to the Moser-Tardos variable model and hence 
cannot be viewed as algorithmic proofs of the LLL in full generality. There are many known scenarios for 
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the LLL and its generalizations that fall outside the scope of the variable model ||2^IT71 . Section |3]discusses 
several such scenarios, including random permutations, matchings and spanning trees. 

Recently two efficient algorithms have been developed that go beyond the variable model. Harris and 
Srinivasan extend the Moser-Tardos algorithm to a scenario involving random permutations that origi¬ 
nates in work of Erdos and Spencer ifT^ . Achlioptas and Iliopoulos |5| developed a novel algorithmic “flaw 
correction” framework which allows one to model various applications of the LLL in a flexible manner. 
They show how this captures several applications of the LLL outside the variable model, and even some 
results that might be beyond typical formulations of the LLL. In contrast to the other results mentioned 
here, their framework does not involve an underlying measure /r and is not directly tied to the probabilistic 
setting of the LLL. This has some benefits, but also some restrictions that seem to prevent it from recover¬ 
ing the LLL in full generality. In particular, their publication [21 does not claim a formal connection with 
Theorem ll.il Section [131 contains further discussion of the related work. 

1.1 Our contributions 

The primary motivating question for this work is whether there is an “algorithmic proof’ of the Lovasz 
Local Lemma in general probability spaces. We answer this question in the following sense: We propose an 
algorithmic framework for the general Lovasz Local Lemma, based on a new notion of resampling oracles. 
In this framework, we present an algorithm that finds a point in nr=i (avoiding all undesired events) ef¬ 
ficiently, if given access to three types of subroutines outlined below (the most crucial one being resampling 
oracles). Whether these subroutines can be implemented efficiently is an instance-dependent issue, and we 
discuss this further below. However, we show that the existence of such subroutines is guaranteed by the 
assumptions of the Lovasz Local Lemma. In particular, our algorithm provides a new proof of Theorem ll.il 
(with no further assumptions), and several generalizations thereof, as described below. Algorithmically, we 
reduce the problem of finding a point in fllLi problem of implementing the three subroutines that 

we discuss next. 


1.1.1 Algorithmic assumptions 


In order to discuss algorithms for the LLL in full generality, one must assume some form of access to the 
probability space at hand. It is natural to assume that one can efficiently sample from p, and efficiently check 
whether a given event Ei occurs. However, even under these assumptions, finding the desired output can be 
computationally hard. (We show an example demonstrating this in Section [2^ 1 Therefore, our framework 
assumes the existence of one more subroutine that can be used by our algorithm. This leads us to the notion 
of resampling oracles. 

Let us introduce some notation. An atomic event io in the probability space If will be called a state. We 
write tu ~ /r to denote that a random state u is distributed according to p, and w ~ to denote that the 
distribution is p conditioned on E^. The resampling oracles are defined with respect to a graph G on [n] 


with neighborhood structure T (not necessarily satisfying the (Dep) condition). 
The three subroutines required by our algorithm are as follows. 


• Sampling from p: There is a subroutine that provides an independent random state u) ^ p. 

• Checking events: Lor each i G [n], there is a subroutine that determines whether uj ^ Ei. 

• Resampling oracles: Lor each i £ [n], there is a randomized subroutine rj : H with the following 
properties. 
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(Rl) If Ei is an event and oj ~ then ri{uj) ~ /x. (The oracle r* removes conditioning on Ei.) 

(R2) For any j ^ r+(i), if cj 0 Ej then also rj(a;) ^ Ej. (Resampling an event cannot cause new 
non-neighbor events to occur.) 

When these conditions hold, we say that r-i is a resampling oracle for events Ei, ... ,En and graph G. 


If efficiency concerns are ignored, the first two subroutines trivially exist. We show that (possibly 
inefficient) resampling oracles exist if and only if a certain relaxation of ( |Dep[ ) holds (see Section [T3]) . 

Main Result. Our main result is that we can find a point in fllLi efficiently, whenever the three subrou¬ 
tines above have efficient implementations. 


Theorem (Informal). Consider any probability space, any events Si,..., and any undirected graph G 
on vertex set [n]. If (IGLLI) is satisfied and if the three subroutines described above are available, then our 
algorithm finds a state in nr=i efficiently in terms of the number of calls to these subroutines. 


We make a more precise statement in the following section. We note that this theorem does not assume 
that (DepI holds, and the existence of resampling oracles is actually a strictly weaker condition. Thus, our 
algorithm provides a new proof of Theorem 1 1.1 1 (the existential LLL) under its original assumptions. 


1.2 Our algorithm: MaximalSetResample 

A striking aspect of the work of Moser and Tardos |[30ll is the simplicity and flexibility of their algorithm 
— in each iteration, any event Ei that occurs can be resampled. We propose a different algorithm that is 
somewhat less flexible, but whose analysis seems to be simpler in our scenario. Roughly speaking, our 
algorithm proceeds in iterations where in each iteration we resample events that form an independent set 
in G. The independent set is generated by a greedy algorithm that adds a vertex i and resamples Ei, if i 
is not adjacent to the previously selected vertices and Ei occurs in the current state. This is repeated until 
no events occur. Pseudocode for this procedure is shown in Algorithm [U Nearly identical algorithms have 
been proposed before, particularly parallel algorithms Il30ll24l . although our interest lies not in the parallel 
aspects but rather in making the LLL (and its stronger variants) algorithmic in our general setting. 


Algorithm 1 MaximalSetResample uses resampling oracles to output a state w G nr=i requires the 
three subroutines described in Section 11.1.11 sampling a; ~ checking if an event Ei occurs, and the 
resampling oracles r*. 

1: Initialize u with a random state sampled from 
2: t := 0; 

3 : repeat 

4: t-.= t + l-, 

5 : Jt := 0 

6: while there is f ^ r+( J^) such that oj ^ Ei do 

7 : Let i be the minimum index satisfying that condition; 

8: Jt := Jt U {f}; 

9: OJ := ri{oj)', > Resample Ei 

10 : end while 

11 : until Jt = 0 ; 

12 : return oj. 
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Our algorithmic proof of the LLL amounts to showing that MaximalSetResample terminates, at which 
point w € nr=i clearly holds. Our bound on the running time of MaximalSetResample is shown by the 
following theorem, which is proven in Section [5] We note that our bound is at most quadratic in the quantity 
which was the bound proved by Moser and Tardos l!30ll . 

Theorem 1.2. Suppose that the events Ei,..., En satisfy (IGLLI) and that the three subroutines described 
above in Section li.i.il are available. Then the expected number of calls to the resampling oracles before 
MaximalSetResample terminates is 0( \-x- '^^=1 ilx ■ )• 


1.3 Generalizing the dependency condition 

Erdos and Spencer 1(161 showed that Theorem [TTT] still holds when (|Dep|) is generalized tc0 


Fi[Ei I rij^jEj] < PilEi] 


Vi € [n], J C [n] \ r'*'(i). 


(Lop) 


They playfully called this the “lopsidependency” condition, and called G a “lopsidependency graph”. This 
more general condition enables several interesting uses of the LLL in combinatorics and theoretical com¬ 
puter science, e.g., existence of Latin transversals lIThll and optimal thresholds for satisfiability ITSl . 

Recall that Theorem [L2] did not assume (DepI and instead assumed the existence of resampling oracles. 
It is natural to wonder how the latter assumption relates to lopsidependency. We show that the existence 
of resampling oracles is equivalent to a condition that we call lopsided association, and whose strength lies 
strictly between (Depi and (|Lop[). The lopsided association condition is 


Pr[Ei n F] > Pr[Fi] ■ Pr[F] 

M U U 


Vz € [n],VF G Ei 


(LopA) 


where Ei contains all events E whose indicator variable is a monotone non-decreasing function of the 
indicator variables of (Ej : j ^ r+(i)). We call a graph satisfying (LopAi a lopsided association graph 
for events Ei, , En. 

Theorem (Informal). Resampling oracles exist for events Ei,..., En and a graph G if and only if G is a 
lopsided association graph for events Ei,..., En. 

This equivalence follows essentially from LP duality: The existence of a resampling oracle can be 
formulated as a transportation problem for which the lopsided association condition is exactly the necessary 
and sufficient condition for a feasible transportation to exist. Section IZTl proves this result in detail. 

As remarked above, the dependency conditions are related by ( |Dep| ) ^ (LopAi ^ (Lopi. The first 
implication is obvious since (Depi implies that Ei is independent of F in ( |LopA| ). To see the second 
implication, simply take E = UjeJ V C [n] \ r'^(i) to obtain that Pr^[Fj | UjgjFj] > Pr^[Fj]. 

Although lopsided association is formally a stronger assumption than lopsidependency, every use of the LLL 
with lopsidependency that we have studied actually satisfies fhe sfronger lopsided association condition. We 
demonsfrafe Ibis in Section [3]by designing efficienl resampling oracles for fhose scenarios. Consequenfly, 
Theorem 11.21 makes fhe LLL efficienl in fhose scenarios. 

As remarked above, Secfion 12.21 describes a scenario in which (|Dep|) and (IGLLI) are salisfied for a 


dependency graph G bul finding a slale lo € (HILi compulalionally hard, assuming slandard complexify 


Iheorelic beliefs. In fhaf scenario resampling oracles musf necessarily exisf since (Dep i is salisfied, bul Ihey 


* More precisely, ) |Lop[ ) should be restricted to J for which Pr^ilO^g jFj] > 0. However that restriction is ultimately unneces¬ 
sary because, in the context of the LLL, the theorem of Erdos and Spencer implies that > 0. 
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cannot be efficiently implemented due to the computational hardness. Therefore the equivalence between 
(LopA) and resampling oracles comes with no efficiency guaranfees. Neverfheless in all lopsidependency 
scenarios fhaf we have encounfered in applicafions of fhe LLL, efficienf implemenfafions of fhe resampling 
oracles arise nafurally from existing work, or can be devised wifh modesf effort. In particular fhis is fhe case 
for random permutations, perfect matchings in complete graphs, and spanning trees in complete graphs, as 
discussed in Section [3l 


1.4 Generalizing the LLL criterion 

In the early papers on the LLL ifTSl [37l . the (IGLLI) criterion relating the dependency graph G and the 
probabilities Pr^[£'j] was shown to be a sufficient condition to ensure that Pr^[Pl”^^ Ei] > 0. Shearer 
discovered a more general criterion that ensures the same conclusion. In fact. Shearer’s criterion is the best 
possible: whenever his criterion is violated, there exist a corresponding measure u and events E-\,... ,En 
for which Pr^[nr=i = 0. 

Section [5] formally defines Shearer’s criferion and uses if in a fundamenfal way fo prove Theorem 11.21 
Moreover, we give an algorifhmic proof of fhe LLL under Shearer’s criferion insfead of fhe (IGLLI) criferion. 
This algorifhm is efficienf in fypical sifuafions, alfhough fhe efficiency depends on Shearer’s paramefers. 
The following simplified resulf is sfafed formally and proven in Section 1531 

Theorem (Informal), Suppose that a graph G and the probabilities Pr^[i?i],..., Pr^[L^„] satisfy Shearer’s 
criterion with e slack, and that the three subroutines described in Section are available. Then the 
expected number of calls to the resampling oracles by MaximalSetResample is 0{j log ^). 

We also prove a more refined bound valid for any probabilifies salisfying Shearer’s criferion. This bound 
is similar fo fhe bound obfained by Kolipaka and Szegedy ll24l : see Secfion [33] for defails. 

Unforfunafely Shearer’s criferion is unwieldy and has nof seen much use in applicafions of fhe LLL. 
Recenfly several researchers have proposed criferia of infermediafe sfrengfh befween (IGLLI) and Shearer’s 
criferion HI [251. The firsf of fhese, called fhe cluster expansion criferion, was originally devised by Bissacof 
ef al. 18], and is based on insighfs from sfafisfical physics. This criferion has given improved resulfs in 
several applicafions of fhe local lemma @|21l3ll- Previous algorifhmic work has also used fhe clusfer 
expansion criferion in fhe variable model |[Il[33l and for permufafions ll2Tll . 

We give a new, elemenfary proof fhaf fhe clusfer expansion criferion implies Shearer’s criferion. In con- 
frasf, fhe previous proof is analytic and requires several ideas from sfafisfical physics [8,1. As a consequence, 
we obfain fhe firsf purely combinaforial proof fhaf fhe exisfenfial LLL holds under fhe clusfer expansion 
criferion. Anofher consequence f Theorem 1 1.31) is an algorifhm for fhe LLL under fhe clusfer expansion cri¬ 
ferion, obfained using our algorifhmic results under Shearer’s criterion. This generalizes Theorem 1 1.21 by 
replacing (IGLLI) with the cluster expansion criterion, stated below as (IGLLI) . To state the result, we require 
additional notation: let Ind denote the family of independent sets in the graph G. 

Theorem 1.3. Suppose that the events Ei,..., E^ satisfy the following criterion 

3yi,... ,yn > 0 such that Pv[Ei] < — - - —=-. (CLL) 

^ z.^jcr+(*),jeind 

and that the three subroutines described in Section li.i.il are available. Then the expected number of calls 
to the resampling oracles before MaximalSetResample terminates is 0( hi '^J=i T Uj))- 
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1.5 Techniques and related work 

The breakthrough work of Moser and Tardos [30l stimulated a string of results on algorithms for the 
LLL. This section reviews the results that are most relevant to our work. Several interesting techniques play 
a role in the analyses of these previous algorithms. These can be roughly categorized as the entropy method 
11281 1^. witness trees or witness sequences BOl 121112^ and forward-looking combinatorial analysis II19II . 

Moser 1291 |28l developed the entropy method to analyze a very simple algorithm for the “symmetric” 
LLL |[T5]| . which incorporates the maximum degree of G and a uniform bound on Pr^[ii^j]. The entropy 
method roughly shows that, if the algorithm runs for a long time, a transcript of the algorithm’s actions 
provides a compressed representation of the algorithm’s random bits, which is unlikely due to entropy 
considerations. 

Following this, Moser and Tardos OOl showed that a similar algorithm will produce a state in HILi 
assuming the independent variable model and the (IGLLI) criterion. This paper is primarily responsible for 
the development of witness trees, and proved the “witness tree lemma”, which yields an extremely elegant 
analysis in the variable model. The witness tree lemma has further implications. For example, it allows one 
to analyze separately for each event its expected number of resamplings. Moser and Tardos also extended 
the variable model to incorporate a limited form of lopsidependency, and showed that their analysis still 
holds in that setting. 

The main advantage of our result over the Moser-Tardos result is that we address the occurrence of 
an event through the abstract notion of resampling oracles rather than directly resampling the variables of 
the variable model. Furthermore we give efficient implementations of resampling oracles for essentially all 
known probability spaces to which the LLL has been applied. A significant difference with our work is 
that we do not have an analogue of the witness tree lemma; our approach provides a simpler analysis when 
the LLL criterion has slack but requires a more complicated analysis to remove the slack assumption. As 
a consequence, our bound on the number of resampling oracle calls is larger than the Moser-Tardos bound. 
Our lack of a witness tree lemma is inherent. Appendix shows that the witness tree lemma is false in the 
abstract scenario of resampling oracles. 

The Moser-Tardos algorithm is known to terminate under criteria more general than (IGLLI) . while still 
assuming the variable model. Pegden |[^ showed that the cluster expansion criterion suffices, whereas 
Kolipaka and Szegedy If2^ showed more generally that Shearer’s criterion suffices. We also extend our 
analysis to the cluster expansion criterion as well as Shearer’s criterion, in the more general context of 
resampling oracles. Our bounds on the number of resampling operations are somewhat weaker than those 
of 13311241, but the increase is at most quadratic. 

Kolipaka and Szegedy 124^ present another algorithm, called GeneralizedResample, whose analysis 
proves the LLL under Shearer’s condition for arbitrary probability spaces. GeneralizedResample is similar 
to MaximalSetResample in that they both work with abstract distributions and that they repeatedly choose 
a maximal independent set J of undesired events to resample. However, the way that the bad events are 
resampled is different: GeneralizedResample needs to sample from p\^ ^ -g-, which is a complicated 

operation that seems difficult to implement efficiently. Thus MaximalSetResample can be viewed as a 
variant of GeneralizedResample that can be made efficient in all known scenarios. 

Harris and Srinivasan ETl show that the Moser-Tardos algorithm can be adapted to handle certain events 
in a probability space involving random permutations. Their method for resampling an event is based on 
the Fischer-Yates shuffle. This scenario can also be handled by our framework; their resampling method 
perfectly satisfies the criteria of a resampling oracle. The Harris-Srinivasan’s result is stronger than ours 
in that they do prove an analog of the witness tree lemma. Consequently their algorithm requires fewer 
resamplings than ours, and they are able to derive parallel variants of their algorithm. The work of Harris 
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and Srinivasan is technically challenging, and generalizing it to a more abstract setting seems daunting. 

Achlioptas and Iliopoulos IH |3l proposed a general framework for finding “flawless objects”, based on 
actions for addressing flaws. We call this the A-I framework. They show that, under certain conditions, 
a random walk over such actions rapidly converges to a flawless object. This naturally relates to the LLL 
by viewing each event Ei as a flaw. At the same time, the A-I framework is not tied to the probabilistic 
formulation of the LLL, and can derive results, such as the greedy algorithm for vertex coloring, that seem 
to be outside the scope of typical LLL formulations, such as Theorem 11.11 The A-I framework 12 |3l has 
other restrictions and does not claim to recover any particular form of the LLL. Nevertheless, the framework 
can accommodate applications of the LLL where lopsidependency plays a role, such as rainbow matchings 
and rainbow Hamilton cycles. In contrast, our framework embraces the probabilistic formulation and can 
recover the original existential LLL (Theorem 1 1.11) in full generality, even incorporating Shearer’s general¬ 
ization. The A-I analysis 121 is inspired by Moser’s entropy method. Technically, it entails an encoding of 
random walks by “witness forests” and combinatorial counting thereof to estimate the length of the random 
walk. The terminology of witness forests is reminiscent of the witness trees of Moser and Tardos, but con¬ 
ceptually they are different in that the witness forests grow “forward in time” rather than backward. This is 
conceptually similar to “forward-looking combinatorial analysis”, which we discuss next. 

Giotis et al. IT^ show that a variant of Moser’s algorithm gives an algorithmic proof in the variable 
model of the symmetric LLL. While this result is relatively limited when compared to the results above, 
their analysis is a clear example of forward-looking combinatorial analysis. Whereas Moser and Tardos 
use a backward-looking argument to find witness trees in the algorithm’s “log”, Giotis et al. analyze a 
forward-looking structure: the tree of resampled events and their dependencies, looking forward in time. 
This viewpoint seems more natural and suitable for extensions. 

Our approach can be roughly described as forward-looking analysis with a careful modification of the 
Moser-Tardos algorithm, formulated in the framework of resampling oracles. Our main conceptual con¬ 
tribution is the simple definition of the resampling oracles, which allows the resamplings to be readily in¬ 
corporated into the forward-looking analysis. Our modification of the Moser-Tardos algorithm is designed 
to combine this analysis with the technology of “stable set sequences” l2^ . defined in Section [5Tl which 
allows us to accommodate various LLL criteria, including Shearer’s criterion. This plays a fundamental role 
in the full proof of Theorem 1 1.2 1 

Our second contribution is a technical idea concerning slack in the LLL criteria. This idea is a perfectly 
valid statement regarding the existential LLL as well, although we will exploit it algorithmically. One 
drawback of the forward-looking analysis is that it naturally leads to an exponential bound on the number 
of resamplings, unless there is some slack in the LLL criterion; this same issue arises in El [El. Our idea 
eliminates the need for slack in the (IGLLI) and (ICLLI) criteria. We prove that, even if (IGLLI) or (ICLLI) are 
tight, we can instead perform our analysis using Shearer’s criterion, which is never tight because it defines 
an open set. For example, consider the familiar case of Theorem 11.11 and suppose that (IGLLI) holds with 
equality, i.e., = Xi njer(i)(l “ show that the conclusion of the LLL remains true 

even if each event Ei actually had the larger probability Pr^lE'j] • (l -f (2 The proof of this 

fact crucially uses Shearer’s criterion and it does not seem to follow from more elementary tools lITSllTTl . 

Follow-up work. Subsequently, Achlioptas and Iliopoulos generalized their framework further to incor¬ 
porate our notion of resampling oracles [4]. This subsequent work can be viewed as a unification of their 
framework and ours; it has the benefit of both capturing the framework of resampling oracles and allowing 
some additional flexibility (in particular, the possibility of regenerating the measure // approximately rather 
than exactly). We remark that this work is still incomparable with ours, primarily due to the facts that our 
analysis is performed in Shearer’s more general setting, and that our algorithm is efficient even when the 













LLL criteria are tight. 

Organization. The rest of the paper is organized as follows. In SectionlH we discuss the connection between 
resampling oracles and the assumptions of the Lovasz Local Lemma. We also show here that resampling 
oracles as well as the LLL itself can be computationally hard in general. In Section |3l we show concrete 
examples of efficient implementations of resampling oracles. In Section |4] we discuss several applications 
of these resampling oracles. Finally, in Section[5]we present the full analysis of our algorithm. 


2 Resampling oracles: existence and efficiency 


The algorithms in this paper make no reference to the lopsidependency condition ( Lop| ) and instead assume 
the existence of resampling oracles. In Section [2T] we show that there is a close relationship between these 


two assumptions: the existence of a resampling oracle for each event is equivalent to the condition (LopAi, 


which is a strengthening of (Lop I. 

We should emphasize that the efficiency of an implementation of a resampling oracle is a separate issue. 
There is no general guarantee that resampling oracles can be implemented efficiently. Indeed, as we show 
in Section 12.21 there are applications of the LLL such that the resampling oracles are hard to implement 
efficiently, and finding a stafe avoiding all events is computationally hard, under standard computational 
complexity assumptions. 

Nevertheless, this is not an issue in common applications of the LLL: resampling oracles exist and can be 
implemented efficiently in all uses of the LLL of which we are aware, even those involving lopsidependency. 
Section[3]has a detailed discussion of several scenarios. 


2.1 Existence of resampling oracles 

This section proves an equivalence lemma connecting resampling oracles with the notion of lopsided asso¬ 
ciation. First, let us define formally what we call a resampling oracle. 

Definition 2.1, Let i?i,..., En be events on a space LI with a probability measure p, and let G = {[n],E) 
be a graph with neighbors ofiG [n] denoted by r(i). Let ri be a randomized procedure that takes a state 
u (z LI and outputs a state ri{ut) € O. We say that r* is a resampling oracle for Ei with respect to G, if 

(Rl) For oj ~ pIep we obtain rj(w) ~ p. (The oracle r* removes conditioning on Ei.) 

(R2) For any j ^ = r(f) U {f}, ifu ^ Ej then also ri{oj) 0 Ej. (Resampling an event cannot cause 

new non-neighbor events to occur.) 

Next, let us define fhe notion of a lopsided association graph. We denote by Efut] the {0, l}-valued 
function indicating whether Ei occurs at a state cj € fl. 

Definition 2.2. A graph G with neighborhood function T is a lopsided association graph for events Ei,..., En 

if 

Fr[Ei n F] > Pr[Fj] ■ Pr[F] Mi € [n],MF € Ei (LopA) 

fj. fj. 

where Ei contains all events E such that E[uj] is a monotone non-decreasing function of the functions 
(Ejico] : 

Lemma 2.3. Consider a fixed i € [n] and assume Pr^[Fj] > 0. The following statements are equivalent. 
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(a) There exists a resampling oracle rj satisfying the conditions (Rl) and (R2) with respect to a neighbor¬ 

hood (ignoring issues of computational efficiency). 

(b) Pr^[-Ei n F] > Pr^[Fj] ■ Vx^[F]for any event F € Fj. 

Corollary 2.4. Resampling oracles ri,..., exist for events Fi,..., with respect to a graph G if and 
only if G is a lopsided association graph for Fi,..., E^. Both statements imply that the lopsidependency 
condition ( |Lop| ) holds. 

Proof (of Lemma l2.3l) . (a) (b): Consider the coupled states {io,u;') where u ~ /r|^. and u' = ri{(jj). By 

(Rl), Lo' ~ p,. For any event F G F, if F does not occur at u) then it does not occur at w' either, due to (R2). 
This establishes that 


Pr[F] = = Pr[F 

fl fl 


Ei 




which implies Pr^[F n Ei] > Pr^[F] • Pr^[Fj]. In particular this implies ( Lop| ), by taking F = |J^ 

(b) => (a): We begin by formulating the existence of a resampling oracle as the following transportation 
problem. Consider a bipartite graph (U U W, E), where U and W are disjoint, U represents all the states 
a; G satisfying Fj, and W represents all the states w G (2. Edges represent the possible actions of 
the resampling oracle: {u,w) G F if n satisfies every event among (Ej : j ^ ) that w satisfies. 

Each vertex has an associated weight: Eor w G W, we dehne p^^ = Pr^[u;], and for u ^ U, pu = 
Pr^[rt]/Pr^[Fj], i.e, pu is the probability of u conditioned on Fj. We claim that the resampling oracle r* 
exists if and only if there is an assignment fuy^ of values to the edges such that 


Y1w:{u,w)gE fuw — Pu € U 

J2u:{u,w)eE fuw =Pw yweW (1) 

fuw >0 \/u G U, w G W. 


Such an assignment is called a feasible transportation. Given such a transportation, the resampling oracle is 
defined naturally by following each edge from u G U with probability fuw/Pu, and the resulting distribution 
on W is p^. Conversely, for a resampling oracle which, for a given state u G U, generates w G W with 
probability quw, we dehne fuw = PuQuw- This assignment satishes ([Til. 

Our goal at this point is show that (b) implies feasibility of ([T]l. A condition that is equivalent to ([U, 
but more convenient for our purposes, can be determined from EP duality ll^ Theorem 21.11]. A feasible 
transportation exists if and only if 

(lai) EueuPu = E w^wPw 

(1212) E.gAP« < E.er(A)P- VACC/, 

where r(A) = {w gW : 3u G A s.t. (u, w) G E }. This is an extension of Hall’s condition for the exis¬ 
tence of a perfect matching. 

Our goal at this point is show that (b) implies feasibility of (|2]l. Eet us now simplify (O. Eix any A C 
U. The neighborhood r(A) consists of states satisfying at most those events among { Ej : j ^ r+(z) } 
satished by some state in A. Thus r(A) corresponds to an event E' such that F'[a;] is a non-increasing 
function of ( Ej[(jj] : j ^ r+(i)). Next observe that, if the set of events among { Ej : j G r'''(f) } satished 
by u' G (7 is a subset of those satished by u G U, then r(rt') C r(u). Suppose that, for each u G A, we 
add to A all such vertices u'. Doing so can only increase the left-hand side of (|2]2), but does not increase 
the right-hand side as r(A) remains unchanged (since r(u') C r(n)). Eurthermore, the resulting set A 
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corresponds to the same event F', but restricted to the states in U. Let us call such a set A non-increasing. 
Let dlf ) denote the simplification of ^ in which we restrict to non-increasing A. We have argued that (O 
and (Ilf) are equivalent. 

Our goal at this point is show that (b) implies feasibility of (Ef ). One may easily see that (b) is equivalent 
to 

Fr[F n Ei] < Pr[F] ■ Fr[Ei] VF € j;. 

fj. 

Assuming Pr[Fj] > 0, we can rewrite this as Pr^[F | Ei] < Pr^[F] MF E F). Now consider using this 
inequality with F = F' for each F' corresponding to some non-increasing set A F U. We then have 
Pr^[F' I Ei] = Y^u&aPu and Pr^[F'] = Y1w&t{A)Pw This verifies fhe feasibilify of (Ef). □ 

2.1.1 Example: monotone events on lattices 

This section presents an example of a setting where Lemma 12.31 implies the existence of a non-trivial re¬ 
sampling oracle, even though the lopsided association graph is empty. This setting was previously known 
to have connections to the existential LLL 1261 . The probability space here is 12 = {0,1}^, viewed in the 
natural way as the Boolean lattice with operations A (meet) and V (join), and with the partial order denoted 
>. Let fi : {0,1}^ -A [0,1] be a probability distribution, i.e., X^^eto ~ assume that /U is 

log-supermodular, meaning that 

p{xV y)fi{x Ay) > y{x)p{y) Vx,yE{0,1}^. 

As an example, any product distribution is log-supermodular. Consider monotone increasing events Ei, i.e., 
such that x' > X ^ Ei ^ x' ^ Ei. Note that any monotone increasing function of such events is again 
monotone increasing. It follows directly from the FKG inequality Q that condition (b) of Lemma 12.31 is 
satisfied for such events with an empty lopsided association graph. Therefore, a resampling oracle exists in 
this setting. However, the explicit description of its operation might be complicated and we do not know 
whether it can be implemented efficiently in general. 

Alternatively, the existence of the resampling oracle can be proved directly, using a theorem of Hol¬ 
ley II 22 I Theorem 6]. The resampling oracle is described in Algorithm El The reader can verify that this 
satisfies the assumptions (Rl) and (R2), using Holley’s Theorem. 


Algorithm 2 Resampling oracle for a monotone increasing event E. Let v be the function guaranteed by 
Theorem 122] when p,i{x) = ’ ^ 2 ( 2 /) = /u(y), and Ix&E is the indicator function of x E F. 

1: Function r£;(x): 

2: If X 0 E, fail. 

3: Randomly select y with probability YF7P0W)' 

4: return y. 


Theorem 2.5 (Holley’s Theorem). Let pi and p 2 be probability measures on {0, 1}^ satisfying 
pi{xy y)y 2 {x Ay) > pi{x)p 2 {y) Vx, y E {0,1}^-^. 
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Then there exists a probability distribution v : {0,1}^ x {0,1}^^ —>^ M satisfying 

Ti{x) = T.y^{x,y) 

T 2 {y) = T,x^{x,y) 

r'{x,y) = 0 unless x>y. 


2.2 Computational hardness of the LLL 


This section considers whether the LLL can always be made algorithmic. We show that, even in fairly 
simple scenarios where the LLL applies, finding the desired output can be computationally hard, a fact that 
surprisingly seems to have been overlooked. We first observe that the question of algorithmic efficiency 
must be stated carefully otherwise hardness is trivial. 

A trivial example. Given a Boolean formula let the probability space be = {0,1}, and let p, be the 
uniform measure on fl. There is a single event Ei defined to be = {1} if 0 is satisfiable, and Ei = {0} if 
(f) is not satisfiable. Since Pr[i?i] = 1/2, the (IGLLI) criterion holds trivially with xi = 1/2. The LLL gives 
the obvious conclusion that there is a state lo ^ E. Yet, finding this state requires deciding satisfiability of 
f, which is NP-complete. 


The reason that this example is trivial is that even deciding whether the undesired event has occurred is 
computationally hard. A more meaningful discussion of LLL efficiency ought to rule out this trivial example 
by considering only scenarios that satisfy some reasonable assumptions. With that in mind, we will assume 
that 


• there is a probability space Q, whose states can be described by m bits; 

• a graph G satisfying (DepI for events Ei,, E^ is explicitly provided; 

• xi,..., Xn G (0,1) satisfying the (IGLLI) conditions are provided, and most poly(n); 

• there is a subroutine that provides an independent random state a; ~ /r in poly(m) time; 

• for each f G [n], there is a subroutine which determines for any given a; G whether ix G Ei, in 
poly(m) time. 

As far as we know, no prior work refutes the possibility that there is an algorithmic form of the LLL, with 
running time poly(m, n), in this general scenario. 

Our results imply that resampling oracles do exist in this general scenario, so it is only the question of 
whether these resampling oracles are efficient that prevents Theorem 11.21 from providing an efficient algo¬ 
rithm. Nevertheless, we show that there is an instance of the LLL that satisfies the reasonable assumptions 
stated above, but for which finding a state in P|^ Ei requires solving a problem that is computationally 
hard (under standard computational complexity assumptions). As a consequence, we conclude that the re¬ 
sampling oracles cannot always be implemented efficiently, even under the reasonable assumptions of this 
general scenario. 

We remark that NP-completeness is not the right notion of hardness here |[32l . Problems in NP involve 
deciding whether a solution exists, whereas the LLL guarantees that a solution exists, and the goal is to 
explicitly find a solution. Our result is instead based on hardness of the discrete logarithm problem, a 
standard belief in computational complexity theory. In the following, GF(p’^) for a prime p and integer n 
denotes a finite field of order p"^, and GF*(p”) its multiplicative group of nonzero elements. 


Theorem 2.6. There are instances of events Ei,... ,En on a probability space 0 = {0,1}” under the 
uniform probability measure, such that 

• the events Ei are mutually independent; 
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• for each i € [n], the condition uj €z Ei can be checked in poly{n) time for given oj E O; 

• the (IGLLI) conditions are satisfied with Xi = 1/2 for each i E [n]; 

but finding a state in fllLi hard as solving the discrete logarithm problem in GF*(2"^). 

Remark. Superficially, this result seems to contradict the fact that the LLL can be made algorithmic in the 
variable model l|30l, where events are defined on underlying independent random variables. The key point 
is that the variable model also relies on a particular type of dependency graph (defined by shared variables) 
which might be more conservative than the true dependencies between the events. Theorem 12.61 shows that, 
even if the probability space consists of independent {0,1} random variables, the LLL cannot in general be 
made algorithmic if the true dependencies are considered. 

Proof Consider an instance of the discrete logarithm problem in the multiplicative group GF*(2”). The 
input is a generator (7 of GF*(2”) and an elementE GF*(2’^). The goal is to find an integer 1 <k< 2” —1 
such that = h. We define an instance of n events on = {0, !}"■ as follows. 

We identify Q = {0,1}” with [2*^] as well as GF(2’^) in a natural way. We define / : [2”] —>■ GF(2”) 
by /(O) = 0 and f{x) = for x 0, where the exponentiation is performed in GF(2”). For each i E [n], 
we define an event Ei that occurs for u E {0,1}”^ iff {f{uj))i = 1 — /i*. This is a condition that can be 
checked in time poly(n), by computing /(tu) = 5 “ where we interpret co as Yl/Zo ^* 2 * and compute g^ by 
taking squares iteratively. 

Observe that for oj distributed uniformly in 0 = {0,1}”, /(w) is again distributed uniformly in 0, 
since / is a bijection (0 is mapped to 0 , and f{uj) for w 0 generates each element of the multiplicative 
group GF*(2”) exactly once). Therefore, the probability of Ei is 1/2, for each i E [n]. Further, the events 
El,, En are mutually independent, since for any J C [n], Eji occurs iff f{oj) = /i©lj, 

which happens with probability 1/2”. Here Ij E {0,1}” is the indicator vector for the set J, and © denotes 
addition in GF(2”) (i.e., component-wise xor in {0,1}”). Hence the dependency graph is empty, and the 
LLL with parameters Xi = 1/2 trivially implies that there exists a state oj avoiding all the events. In this 
instance, we know explicitly that the state avoiding all the events is f~^{h). Therefore, if we had an efficient 
algorithm to find this point for any given h E GF*(2”), we would also have an efficient algorithm for the 
discrete logarithm problem in GF(2”). □ 

3 Implementation of resampling in specific settings 

In this section, we present efficient implementations of resampling oracles in four application settings: inde¬ 
pendent random variables (which was the setting of If30]| ). random permutations (handled by GU I, perfect 
matchings in complete graphs (some of whose applications are made algorithmic by |* 21 ), and spanning trees 
in complete graphs (which is a new scenario that we can handle). To be more precise, resampling oracles 
also depend on the types of events and dependencies that we want to handlejl In the setting of independent 
random variables, we can handle arbitrary events with dependencies defined by overlapping relevant vari¬ 
ables, just like OOl . In the setting of permutations, we handle the appearance of patterns in permutations as 
in GTIl . In the settings of matchings and spanning trees, we consider the “canonical events” defined by 1261, 
characterized by the appearance of a certain subset of edges. We also show in Section |33] how resampling 
oracles for a certain probability space can be extended in a natural way to products of such probability spaces 
(for example, how to go from resampling oracles for one random permutation to a collection of independent 
random permutations). These settings cover all the applications of the lopsided LLL that we are aware of. 

^In Section lT^ we give an example of events on independent random variables for which resampling oracles exist hut cannot be 
made efficient. 
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3.1 The variable model 


This is the most common setting, considered originally by Moser and Tardos 1301 . Here, Q has a product 
structure corresponding to independent random variables { Xa : a & U }. The probability measure /r here 
is a product measure. Each bad event Ei depends on a particular subset of variables Ai, and two events are 
independent iff Ai n Aj = 0. 

Here our algorithmic assumptions correspond exactly to the Moser-Tardos framework |[30l . Sampling 
from /i means generating a fresh set of random variables independently. The resampling oracle takes a 
state u! and replaces the random variables { Xa : a G Aj } by fresh random samples. It is easy to see that the 
assumptions are satisfied: in particular, a random state sampled from fj, conditioned on Ei has all variables 
outside of Aj independently random. Hence, resampling the variables of Aj produces the distribution //. 
Clearly, resampling { Xa : a G Aj } does not affect any events whose variables do not intersect Aj. 

We note that this resampling oracle is also consistent with the notion of lopsidependency on product 
spaces considered by ll30l : They call two events Ei , Ej lopsidependent, if Aj fi Aj 7 ^ 0 and it is possible 
to cause Ej to occur by resampling Aj in a state where Ei holds but Ej does not (the definition in |[30l is 
worded differently but equivalent to this). This is exactly the condition that we require our resampling oracle 
to satisfy. 

3.2 Permutations 

The probability space here is the space of all permutations vr on a set [n], with a uniform measure /r. 
The bad events are assumed to be “simple” in the following sense: Each bad event Ei is defined by a 
“paffem” P{Ei) = {(xi, yi),..., {xt(i),yt{i))}. The event Ei occurs if ^{xj) = yj for each 1 < y < t{i). 
Eet vbl(£'j) = { X : 3y, {x, y) G P{Ei) } denote the variables of tt relevant to event Ej. Eet us define a 
relation i ~ T to hold iff there are pairs {x, y) G P{Ei), (x', y') G P{Eii) such that x = x' or y = y'; i.e., 
the two events entail the same value in either the range or domain. This relation defines a lopsidependency 
graph. It is known that the lopsided EEE holds in this setting. 


Algorithm 3 Resampling oracle for permutations 
1: Function rj(7r): 

2: X := vbl(Sj), i.e., the variables in vr affecting event Ei, 

3: Eix an arbitrary order X = (xi, X 2 , • • •, x*); 

4: for i = t down to 1 do 

5: Swap 7r(xj) with 7r(z) for z uniformly random among [n] \ {xi,..., Xj_i}; 

6 : end for 
7: return vr; 


Harris and Srinivasan showed how, under the EEE criteria, a permutation avoiding all bad events can 
be found algorithmically. We implement the resampling oracle based on their algorithm (see Algorithm |3]l. 
To prove the correctness of this resampling oracle within our framework, we need the following lemma. 

Lemma 3.1. Suppose that a permutation tt has some arbitrary fixed assignment on the variables in X, 
T^\x = and it is uniformly random among all permutations satisfying 7r|x = Then the output of 
Shuffle{'K, X) is a uniformly random permutation. 

The procedure is known as the Eisher-Yates shuffle for generating uniformly random permutafions (and 
was used in ll^ as well). In contrast to the full shuffle, we assume that some part of the permutation has 
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been shuffled already: X is the remaining portion that still remains to be shuffled, and conditioned on its 
assignment the rest is uniformly random. This would be exactly the dishibution achieved after performing 
the Fisher-Yates shuffle on the complement of X. Our procedure performs the rest of the Fisher-Yates 
shuffle, which produces a uniformly random permutation. For completeness we give a self-contained proof. 

Proof. Let X = {xi,...,xt}. By induction, after performing the swap for Xj, the permutation is uniform 
among all permutations with a fixed assignment of {xi, ..., Xi_i} (consistent with (j)). This holds because, 
before the swap, the permutation was by induction uniform conditioned on the assignment of {xi,..., Xj} 
being consistent with and we choose a uniformly random swap for x, among the available choices. This 
makes every permutation consistent with f on {xi,..., Xj_i} equally likely after this swap. □ 

This verifies the first condition for our resampling oracle. The second condition is that resampling of 
occurring events does not affect non-neighbor events. This is true because of the following lemma. 

Lemma 3.2. The resampling oracle rj(7r) applied to a permutation satisfying Ei does not cause any new 
event outside ofT^{I) to occur. 

Proof. Suppose Ej changed its status during a call to ri(7r). This means that something changed among its 
relevant variables Yhl{Ej). This could happen in two ways: 

(1) either a variable z € vhl{Ej) was swapped because z ^ X = vbl(F^i); then clearly j G r'''(i). 

(2) or, a variable in vhl{Ej), although outside of X, received a new value by a swap with some variable 

in Y = vbl(£'i). Note that in the Shuffle procedure, every time a variable z outside of X changes its value, 
it is by a swap with a fresh variable of X, i.e. one that had not been processed before. Therefore, the value 
that z receives is one that previously caused Ei to occur. If it causes Ej to occur, it means that Ei and Ej 
share a value in the range space and we have j € as well. □ 

3.3 Perfect matchings 

Here, the probability space is the set of all perfect matchings in K 2 n, with the uniform measure. This 
is a setting considered by [21 and it is also related to the setting of permutations. (Permutations on [n] 
can be viewed as perfect matchings in Kn,n-) A state here is a perfect matching in K 2 n, which we denote 
by M € n. We consider bad events of the following form: Ea for a set of edges A occurs if Y C M. 
Obviously, Pr^[£'y 4 ] > 0 only if Y is a (partial) matching. Let us define A. ~ H iff ^ U H is not a matching. 
It was proved in |[26l that this defines a lopsidependency graph. 

Our goal is to implement a resampling oracle in this setting. We describe such an operation in Algo¬ 
rithmic 

Lemma 3.3. Let Abe a matching in K 2 n and let M be distributed uniformly among perfect matchings in 
K 2 n such that A C M. Then after calling the resampling oracle, rA{M) is a uniformly random perfect 
matching. 

Proof. We prove by induction that at any point, M' is a uniformly random perfect matching conditioned on 
containing A'. This is satisfied at the beginning: M' = M, A' = A and M is uniformly random conditioned 
on A C M. 

Assume this is true at some point, we pick (w, v) G A' arbiharily and (x, y) G M' \ A' uniformly at 
random. Denote the vertices covered by M'\A' by V{M'\A'). Observe that for a uniformly random perfect 
matching on V(M' \ A') U {u, x}, the edge (u, v) should appear with probability 1/(2|M' \ A'| -|- 1) since 
u has 2|M' \ A'l -|- 1 choices to be matched with and x is 1 of them. Consequently, we keep the edge {u, v) 
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Algorithm 4 Resampling oracle for perfect matchings 

1: Function rA{M): 

2: Check that A C M, otherwise return M. 

3: A' := A; 

4: M' := M; 

5: while A' 7 ^ 0 do 

6: Pick (n, v) € A' arbitrarily; 

7: Pick (x, y) € M' \ A' uniformly at random, with (x, y) randomly ordered; 

8 : With probability 1 - 2 \ m'\A' \ +i ^ 

9: Add (tt, y), {v, x) to M' and remove {u, v), (x, y) from M'; 

10 : Remove {u, v) from A'; 

11 : end while 
12 : return M'. 


with probability 1/(2|M' \ A'| + 1) and conditioned on this M' \ A' is uniformly random by the inductive 
hypothesis. Conditioned on (u, v) not being part of the matching, we re-match (rt, v) with another random 
edge (x, y) € M' \ A' where (x, y) is randomly ordered. In this case, u and v get matched to a uniformly 
random pair of vertices x,y ^ V {M' \ A'), as they should be. The rest of the matching M' \ A' \ {(x, y)} 
is uniformly random on V(M' \ A' \ {x, y}) by the inductive hypothesis. 

Therefore, after each step M'\A' is uniformly random conditioned on containing A'. At the end. A' = 0 
and M' is uniformly random. □ 

Lemma 3.4. The resampling oracle r^iM) applied to a perfect matching satisfying event Ea does not 
cause any new event Eb such that B ^ r'’'(A). 

Proof Observe that all the new edges that the resampling oracle adds to M are incident to some vertex 
matched by A. So if an event Eb was not satisfied before the operation and it is satisfied afferwards, it must 
be the case that B contains some edge not present in A but sharing a vertex with A. Hence, A U H is not a 
matching and A ~ H. □ 


3.4 Spanning trees 


Here, the probability space 0 is the set of all spanning trees in Let us consider events Ea for a set of 
edges A, where Ea occurs for T GO iff ACT. Define A ~ H for disfinct A, B unless A and B are 
vertex-disjoint. Lu et al. Lemma 7] show that this in fact defines a dependency graph for spanning trees. 


It is worth emphasizing that in this scenario the (Dep I condition holds (the more general condition (Lop i is 
not needed), but the scenario does not fall within the scope of the Moser-Tardos variable model. It does fall 
within the scope of our framework, but one must design a non-trivial resampling oracle. 

To implement a resampling oracle in this setting, we will use as a subroutine an algorithm to generate 
a uniformly random spanning tree in a given graph G. This can be done efficiently by several methods, for 
example by a random walk ifTOll . 


Lemma 3.5. If A is a fixed forest and T is a uniformly random spanning tree in conditioned on A CT, 
then rA{T) produces a uniformly random spanning tree in Kn- 
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Algorithm 5 Resampling oracle for spanning trees 
1: Function r A (T): 

2 : Check that A <ZT, otherwise fail. 

3: Let W = V f A), the vertices covered by A. 

4: Let Ti = 2 ) n T, the edges of T disjoint from W . 

5: Let Fi = ('^2'^) \ T, the edges disjoint from W not present in T. 

6 : Let G 2 = {Kn \ Fi)/Ti be a multigraph obtained by deleting Fi and contracting Ti. 
7: Generate a uniformly random spanning tree T 2 in G2. 

8: return Ti U r 2 . 


Proof. First, observe that since T2 is a spanning tree of G2 = {Kn \ Fi)/Ti, it is also a spanning tree of 
Kn/Ti where Ti is a forest, and therefore Ti U r 2 is a spanning tree of Kn- We need to prove that it is a 
uniformly random spanning tree. 

First, we appeal to a known result tIEl Lemma 6] stating that given a forest F in Kn with components 
of sizes (number of vertices) /i, / 2 , ■ ■ ■, /m^ the number of spanning trees containing F is exactly 




fi 


(3) 


Equivalently (since n" ^ is the total number of spanning trees), for a uniformly random spanning tree T, 
Pr[F CT] = UT= 1 This has the surprising consequence that for vertex-disjoint forests Fi, F 2 , 

we have Pr[Fi U F 2 C T] = Pr[Fi C T] • Pr[F 2 F T], i.e., the containment of Fi and F2 are independent 
events. (In a general graph, the appearances of different edges in a random spanning tree are negatively 
correlated, but here we are in a complete graph.) 

Let W = V{A) and let B be any forest on P \ W, i.e., vertex-disjoint from A. By the above, the 
appearance of F in a uniformly random spanning tree is independent of the appearance of A. Hence, if 
T is uniformly random, we have Pr[i? C T \ A C T] = Pr[i? C T]. This implies that the distribution 
of T n (^ 2 ^^) is exactly the same for a uniformly random spanning tree T as it is for one conditioned on 
AFT (formally, by applying the inclusion-exclusion formula). Therefore, the forest Ti = T n (^ 2 ^) is 
distributed as it should be in a random spanning tree restricted to P \ kF. 

The final step is that we extend Ti to a spanning tree Ti UT 2 , where T 2 is a uniform spanning tree in 
G 2 = {Kn \ Fi)/Ti. Note that G 2 is a multigraph, i.e., it is important that we preserve the multiplicity of 
edges after contraction. The spanning trees T 2 in G 2 = {Kn \ Fi)/Ti are in a one-to-one correspondence 
with spanning trees in Kn conditioned on Tfl (^ 2 ^) = Tj. This is because each such tree T 2 extends Ti to 
a different spanning tree of Kn, and each spanning tree where T D (^ 2 '^) = Ti can be obtained in this way. 
Therefore, for a fixed Ti, Ti U T 2 is a uniformly random spanning free conditioned on T D (^ 2 ^^) = Ti. 
Finally, since fhe disfribufion of Ti is equal fo thaf of a uniformly random spanning free resfriefed fo P \ W, 
Ti U r 2 is a uniformly random spanning free. □ 

Lemma 3.6. The resampling oracle rA{T) applied to a spanning tree satisfying Ea does not cause any new 
event Eb such that B ^ r'’'(A). 

Proof Nofe fhaf fhe only edges fhaf we modify are fhose incidenf fo IP = V{A). Therefore, any new 
evenf Eb thaf fhe operation of could cause musf be such fhaf B confains an edge incidenf fo W and not 
contained in A. Such an edge shares exactly one vertex with some edge in A and hence B ^ A. □ 
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3.5 Composition of resampling oracles for product spaces 

Suppose we have a product probability space x Q 2 x • • • x where on each we have resampling 

oracles rij for events Eij,j € £i, with respect to a graph Gi. Our goal is to show that there is a natural way 
to combine these resampling oracles in order to handle events on O that are obtained by taking intersections 
of the events Eij. The following theorem formalizes this notion. 

Theorem 3.7. Let Lli,... be probability spaces, where for each Oj we have resampling oracles rijfor 
events Eij,j G Ei with respect to a graph Gi. Let O = Oi x O 2 x ... Ojv be a product space with the 
respective product probability measure. For any set J of pairs G Ei where each i G [A^] appears 

at most once, define an event Ej on El to occur in a state uj = (wi,... ,a;jv) iff Eij occurs in ooifor each 
{i,j) G J. Define a graph G on these events by J ^ J' iff there exist pairs {i,j) G J, E J' such that 
j ~ j' in Gi. Then there exist resampling oracles rjfor the events Ej with respect to G, which are obtained 
by calling in succession each of the oracles Vijfor {i,j) G J. 

Proof. For notational simplicity, let us assume that on each Eli we have a trivial event Eo) = Eli and the 
respective resampling oracle r^o is the identity on Eli. Then we can assume that each collection of events J 
is in the form J = {(1, ji), ( 2 ,^ 2 )) • • • j {Ef,j]y)}, where we set j£ = 0 for components where there is no 
event to resample. We define 

rj{u)i,.. .,ujn) = {rij^{u:i),r2j2{u)2), ■ ■ ■ ,rNjN{(^N))- 

We claim that these are resampling oracles with respect to G as defined in the theorem. 

Let us denote by pi the probability distribution on Eli and by p the product distribution on fl. For 
the first condition, suppose that w ~ t\ej- By the product structure of ff, this is the same as having 
u) = {u!i,... ,ujn) where the components are independent and uj£ ~ belEtj^ for each G J, and 

(jj£ ~ p£ for components such that j£ = 0. By the properties of the resampling oracles r£j^, we have 
r£jf {(jJ£) ~ P£. Since the resampling oracles are applied with independent randomness for each component, 
we have 

rj{uj) = {rijj^{uJi),r 2 j 2 {uJ 2 ),-■ ■ ,rNjNi‘^N)) ~ pi x p 2 x ... x pN = P- 

For the second condition, note that if a; ^ Ejt and rjfjj) G Ej/, it must be the case that there is 
E J and G J' such that oj£ ^ and r£j^{oj) G However, this is possible only if j£ ~ 

in the graph Gi. By the definition of G, this means that J ~ J' as well. □ 

As a result, we can extend our resampling oracles to spaces like E-tuples of independent random per¬ 
mutations, independent random spanning trees, etc. Such extensions are used in our applications. 

4 Applications 

Let us present a few applications of our framework. Our application to rainbow spanning trees is new, even 
in the existential sense. Our applications to Latin transversals and rainbow matchings are also new to the 
best of our knowledge, although they could also have been obtained using the framework of ll^ and f2l|. 

4.1 Rainbow spanning trees 

Given an edge-coloring of E„, a spanning tree is called rainbow if each of its edges has a distinct color. The 
existence of a single rainbow spanning tree is completely resolved by the matroid intersection theorem: It 
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can be decided efficiently whether a rainbow spanning tree exists for a given edge coloring, and it can be 
found efficiently if it exists. However, the existence of multiple edge-disjoint rainbow spanning trees is more 
challenging. An attractive conjecture of Brualdi and Hollingsworth ifTTI states that if n is even and Kn is 
properly edge-colored by n — 1 colors, then the edges can be decomposed into re /2 rainbow spanning trees, 
each tree using each color exactly once. Until recently, it was only known that every such edge-coloring 
contains 2 edge-disjoint rainbow spanning trees |S]. In a recent development, it was proved that if every 
color is used at most re /2 times (which is true for any proper coloring) then there exist H(re/logre) edge- 
disjoint rainbow spanning trees ifT^ . In fact this result seems to be algorithmically efficient, although this 
was not claimed by the authors. We prove that using our framework, we can find Q{n) rainbow spanning 
trees under a slight strengthening of the coloring assumption. 

Theorem 4.1, Given an edge-coloring of Kn such that each color appears on at most edges, at 

least edge-disjoint rainbow spanning trees exist and can be found in 0{nf) resampling oracle calls 

with high probability. 

This result relies on Theorem 11.31 our algorithmic version of the LLL under the cluster expansion cri¬ 
terion. To obtain the result with high probability, we appeal to a more refined bound that we state in Theo¬ 
rem |5iMl We note that if there is constant multiplicative slack in the assumption on color appearances, the 
number of resamplings improves to O(re^), using the result in Theorem [534] with constant e slack. 

To prove the existential statement, we simply sample MWn independently random spanning trees and 
hope that they will be (a) pairwise edge-disjoint, and (b) rainbow. This unlikely proposition happens to be 
true with positive probability, thanks to the LLL and the independence properties of random spanning trees 
that we mentioned in Section 1331 Given this setup, our framework implies that we can also find the rainbow 
trees efficiently. 

Proof We apply our algorithm in the setting of t independent and uniformly random spanning trees Ti,... ,Tt C 
Kn, with the following two types of bad events: 

• Elj-. For each i G [t] and two edges e 7 ^ / in Kn of the same color, E^j, occurs if {e, /} C Tp, 

• E]j\ For each i j ^ [t] and an edge e in Kn, E^ occurs if e G Tj n Tj. 

Clearly, if no bad event occurs then the t trees are rainbow and pairwise edge-disjoint. 

By (O the probability of a bad event of the first type is Pr[£^*j] = 3/re^ if |e U /| = 3 and Pr[£^*j] = 

4/re^ if |e U /I = 4. The probability of a bad event of the second type is Pr[i?e'^] = (2/re)^ = 4/re^, since 
each of the two trees contains e independently with probability 2/re. Hence, the probability of each bad 
event is upper-bounded by p = 4/re^. 

In Section 13.41 we constructed a resampling oracle for a single spanning tree. By Theorem 13.71 
this resampling oracle extends in a natural way to the setting of t independent random spanning trees. In 
particular, for an event E'gj, we define r® ^ as an application of the resampling oracle to the tree Tj. For 

an event Ee , we define as an application of the resampling oracle rjgj independently to the trees Tj and 
Tj. It is easy to check using Theorem l3.7l that for independent uniformly random spanning trees conditioned 
on either type of event, the respective resampling oracle generates independent uniformly random spanning 
trees. 

Let us define the following dependency graph; we are somewhat conservative for the sake of simplicity. 

The graph contains the following kinds of edges: 

• Tgj ~ ^e'p whenever e U / intersects e' U /'; 


19 






• -Elf whenever e' intersects e U /; 

• El^ ~ E^J , £'*/ whenever e' intersects e. 

We claim that the resampling oracle for any bad event can cause new bad events only in its neighborhood. 
This follows from the fact that the resampling oracle affect only the trees relevant to the event (in the 
superscript), and the only edges modified are those incident to those relevant to the event (in the subscript). 

Let us now verify the cluster expansion criterion, introduced as (ICLLI) in Section 11.41 so that we may 
apply Theorem 15 .441 Let us assume that each color appears on at most q edges, and we generate t random 
spanning trees. We claim that the neighborhood of each bad event can be partitioned into 4 cliques of size 
{n — l){t — 1) and 4 cliques of size (n — l)(q — 1). 

First, let us consider an event of type £'*y. The neighborhood of consists of: (1) events El,j:, where 
e' or /' shares a vertex with e U /; these events form 4 cliques, one for each vertex of e U /, and the size of 
each clique is at most (n — 1 ) (q — 1 ), since the number of incident edges to a vertex is n — 1 , and the number 
of other edges of the same color is at most q — 1. (2) events E^J where e' intersects e U /; these events form 
4 cliques, one for each vertex of e U /, and each clique has size at most (n — l)(f — 1), since its events can 
be identified with the (n — 1 ) edges incident to a fixed vertex and the remaining t — 1 trees. 

Second, let us consider an event of type El^. The neighborhood of El^ consists of: (1) events and 
E^ijr/ where e intersects e' U /'; these events form 4 cliques, one for each vertex of e and either i or j in 
the superscript, and the size of each clique is at most (n — l)(q — 1 ) by an argument as above. (2) events 
E'‘J ,E^^, where e' intersects e; these events form 4 cliques, one for each vertex of e and either i'j or ij' in 
the superscript. The size of each clique is at most (n — l)(f — 1), since the events can be identified with the 
(n — 1 ) edges incident to a vertex and the remaining t — 1 trees. 

Considering the symmetry of the dependency graph, we set the variables for all events equal to y* ^ = 

He = y- The cluster expansion criteria will be satisfied if we set the parameters so that 

^_ y _ ^ _ y _ 

^ - (1 + (n - l)(f - l)y)^(l + {n-l){q- l)y)^ “ E/cr+(i<;),/Gind y^ ’ 

where E denotes either Elj or Ee ■ The second inequality holds due to the structure of the neighborhood of 
each event that we described above. We set y = /3p = 4/)/n^ and assume t < 'jn,q < jn. The reader can 
verify that with the settings (3 = (|)® and 7 = ^(|)^^ we get = 1- Therefore, 

^ I3p ^ _ y _ 

^ “ (1 + 47 / 3 )^ “ (1 + (n - l)(f - l)y)^(l + {n - l){q - l)y)‘^ 

which verifies the assumption of Theorem l5.44l Theorem l5.44l implies that MaximalSetResample terminates 
after 0((X] ylf + y^Y) resampling oracle calls with high probability. The total number of events here 
is 0{tqv?) = O(n^) and for each event the respective variable is y = 0{l/n?). Therefore, the expected 
number of resampling oracle calls is O(n^). □ 

4.2 Rainbow matchings 

Given an edge-coloring of K 2 n, a perfect matching is called rainbow if each of its edges has a distinct color. 
This can be viewed as a non-bipartite version of the problem of Latin transversals. It is known that given any 
proper ( 2 n — 1 )-edge-coloring of K 2 n (where each color forms a perfect matching), there exists a rainbow 
perfect matching |[38l. However, finding rainbow matchings algorithmically is more difficult. Achlioptas 
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and Iliopoulos [ 2 ] showed how to find a rainbow matching in K 2 n efficiently when each color appears on at 
most 7 n edges, 7 < ^ ~ 0.184. Our result is that we can do this for 7 = ^ ~ 0.211. The improvement 
comes from the application of the “cluster expansion” form of the local lemma, which is still efficient in our 
framework. (We note that an updated version of the Achlioptas-Iliopoulos framework O also contains this 
result.) 

Theorem 4.2. Given an edge-coloring of K 2 n where each color appears on at most -^n edges, a rainbow 
perfect matching exists and can be found in O(re^) resampling oracle calls with high probability. 

In fact, we can find many disjoint rainbow matchings — up to a linear number, if we replace ^ above 
by a smaller constant. 

7? 7? 

Theorem 4.3. Given an edge-coloring of K 2 n where each color appears on at most edges, at least 
edge-disjoint rainbow perfect matchings exist and can be found in 0{n^) resampling oracle calls whp. 

We postpone the proof to Section 1431 since it follows from our result for Latin transversals. 

Proof of Theorem \4~2\ We apply our algorithm in the setting of uniformly random perfect matchings M C 
K 2 n, with the following bad events (identical to the setup in f2l): For every pair of edges e, / of the same 
color, Eef occurs if {e, /} C M. If no bad event E^f occurs then M is a rainbow matching. We also 
define the following dependency graph: Ef,f ~ Ee'f' unless e,f,e',f' are four disjoint edges. Note that 
this is more conservative than the dependency graph we considered in Section 13.31 where two events are 
only connected if they do not form a matching together. The more conservative definition will simplify our 
analysis. In any case, our resampling oracle is consistent with this lopsidependency graph in the sense that 
resampling E^f can only cause new events Ef,i fi such that E^j ~ E^,/p. We show that this setup satisfies 
the criteria of the cluster expansion lemma. 

Let q = ^n, p = (^ 2 n-i){ 2 n- 3 ) y ~ Consider the neighborhood of a bad event T{Eef). It 

contains all events Epp such that there is some intersection among the edges e, /, e', f. Such events can 
be partitioned into 4 cliques: for each vertex r; G e U /, let denote all the events E^.!fi such that v ^ e' 
and /' has the same color as e'. The number of edges e' incident to r; is 2n — 1, and for each of them, the 
number of other edges of the same color is by assumption at most q — 1. Therefore, the size of is at most 
{q - l)( 2 n - 1 ). 

In the following, we use the short-hand notation = Oig/ Vi- Consider the assumptions of the cluster 
expansion lemma: for each event E^f, we should have 

__ 

Z^/Cr+(Ee/),/€lnd y^ 

We have Pr[i?e/] = p = ( 2 n-i)^( 2 n- 3 ) • symmetry, we set all the variables pef to the same value, 
Vef = y = Note that an independent subset of r+(£'e/) can contain at most 1 event from each clique 

Q^,. (The event Ef,f itself is also contained in these cliques.) Therefore, 


P^Eef] < 


/< 2 /e'/') <(! + (?- 


The reader can 
Therefore, 


/Cr+(£;ej-),/€lnd vGeUf 

verify that ^^cr+(i5,^),/€ind 2/^ < {l + {q-l){ 2 n-l)yf 


Z)/Cr+(£;ey),/€lnd y^ 


> p 


< 


l){2n-l)y)\ 

(l + in2(|)V(2n)2)4 = 
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which is the assumption of Theorem 15.441 By Theorem I5.44[ MaximalSetResample with the resampling 
oracle for matchings and the dependency graph defined above will find a rainbow perfecf mafching in time 
0(E E,f VefYl E,f + Vef)) = C>((E E,f Veff) with high probability. The number of bad events E^f 
is O(n^), because each color class has 0(n) edges so the number of edge pairs of equal color is O(n^). We 
have i/ef = 0(l/n^), and hence the total number of resamplings is 0{n^) with high probability. □ 

4.3 Latin transversals 

A Latin transversal in an n x n matrix A is a permutation tt £ Sn such that the entries Aj (“colors”) are 
distinct for z = 1, 2,... , n. In other words, it is a set of distinct entries, exactly one in each row and one in 
each column. It is easy to see that this is equivalent to a bipartite version of the rainbow matching problem: 
Aij is the color of the edge (i,j) and we are looking for a perfect bipartite matching where no color appears 
twice. It is a classical application of the Lovasz Local Lemma that if no color appears more than times 
in A then there exists a Latin transversal |[T 6 | . An improvement of this result is that if no color appears 
more than times in A then a Latin transversal exists [ 8 ]; this paper introduced the “cluster expansion” 
strengthening of the local lemma. (Note that ^ = fr-) These results were made algorithmically efficient 
by the work of Harris and Srinivasan 121 ■ 

Beyond finding one Lafin fransversal, one can ask whefher fhere exisf multiple disjoinl Latin fransver- 
sals. A remarkable exisfenfial resulf was proved by Alon, Spencer and Tefali fTll: If n = 2 ^ and each color 
appears in A af mosf en limes (e = in Iheir proof), Ihen A can be parlilioned info n disjoinl Latin 

Iransversals. Here, we show how lo find a linear number of Latin Iransversals algorilhmically. 

7 ? 7 ? 

Theorem 4.4. For any nxn matrix A where each color appears at most times, there exist at least 
disjoint Latin transversals, and they can be found in O(n^) resampling oracle calls w.h.p. 

We note lhal again, if fhere is conslanl mulliplicalive slack in Ihe assumplion on color appearances, 
the number of resamplings improves to O(n^). This also implies Theorem 14.31 as a special case: For an 
edge-coloring of K 2 n where no color appears more than times, let us label the vertices arbitrarily 
(ui ,... ,Un,vi,... ,Vn) construct a matrix A where Aij is the color of the edge {ui,Vj). If no color appears 
more than |^n times, by Theorem 14.41 we can find Lafin Iransversals; fhese correspond fo rainbow 
malchings in K 2 n- 

Our approach lo proving Theorem 14.41 is similar lo Ihe proof of Theorem 14.11 sample |g-n indepen- 
denlly random permufalions and hope fhal Ihey will be (a) disjoinl, and (b) Lafin. For reasons similar lo 
Theorem 14.11 Ihe local lemma works ouf and our framework makes fhis algorifhmic. 

Proof. Lei t = and lei vri,... jVr^ be independenlly random permufalions on [n]. We consider Ihe 
following Iwo types of bad evenls: 

• Elj,: For each i G [f] and e = {u,v), f = {x, y) £ [n] x [n] such fhal u v,x y, Auv = A^y, the 
event E''^^ occurs if 'Ki{u) = v and 7 rj(x) = y, 

• Elj-. For each i j £ [t] and e = (u, v) £ [n] x [n], the event Elj occurs if 7rj(n) = 'Kj{u) = v. 

Clearly, if none of these events occurs then the permutations tti ,..., vr^ correspond to pairwise disjoint Latin 
transversals. The probability of a bad event of the first type is Pr[£'*y] = and the probability for the 

second type is Pr[£^e'^] = Thus the probability of each bad event is at most p = . 
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It will be convenient to think of the pairs e = (x, y) E [n] x [re] as edges in a bipartite complete 
graph. As we proved in Section [XU the resampling oracle for permutations is consistent with the following 
lopsidependency graph graph. 

• ~ ^e'f whenever there is some intersection between the edges e, / and e', /'; 

whenever there is some intersection between e' and e, /; 

• E^J ~ £'*■? , E^J whenever e' intersects e. 

By LemmajXU the resampling oracle for a given event never causes a new event except in its neighborhood. 

Let us now verify the cluster expansion criteria. The counting here is quite similar to the proof of 
Theorem 14.11 so we skim over some details. The neighborhood of each event consist of 8 cliques: 4 
cliques of events of type £"*'/' ^ cliques of events of type E]^, corresponding in each case to the 4 

vertices of e U /. In the first case, each clique has at most n{q — 1) events, determined by selecting an 
incident edge and another edge of the same color. In the second case, each clique has at most n{t — 1) 
events, determined by selecting an incident edge and another permutation. 

The neighborhood of each event Ee also consists of 8 cliques: 4 cliques of events -Eg/j/ or T^g/j/, 
corresponding to the choice of either i or j in the superscript, and one of the two vertices of e. The size of 
each clique is at most re(q — 1), determined by choosing an incident edge and another edge of the same color. 
Then, we have 4 cliques of events E'"^, or E'‘J, determined by switching either i' or j' in the superscript, 
and choosing one of the vertices of e. The size of each clique is at most n{t — 1), determined by choosing 
an incident edge and a new permutation in the superscript. 

As a consequence, the cluster expansion criterion here is almost exactly the same as in the case of 
Theorem 14. II ^ 

^ “ (1 + re(f - l)y)‘^(l + n{q - l)y)^ ' 

We have p = here and we set y = j5p. For t,q < yre, it’s enough to satisfy > 1, which 

is achieved by /3 = (|)® and 7 = Therefore, Theorem 15.441 implies that MaximalSetResample will 
terminate within 0((X] ulf + X] = 0{n^) resampling oracle calls with high probability. □ 

5 Analysis of the algorithm 

Here we provide the analysis of our algorithm and the proofs of our main theorems. In Section 15.11 we 
begin with the basic notions necessary for our analysis and a coupling argument which forms the basis of all 
our algorithmic results. In Section [XU we prove a weaker form of Theorem 1 1.2 1 under the assumption that 
the (IGLLI) criterion holds with some slack. In Section [531 we introduce the independence polynomial of a 
graph and summarize its fundamental properties that are important for our analysis. In Section [531 we prove 
that our algorithm is efficient if Shearer’s criterion is satished with an e slack. In Section 15.51 we show that 
in some sense this assumption is not necessary, because every point satisfying Shearer’s criterion has some 
slack available, and we quantify how large this slack is. Finally, we return to the weaker (but more practical) 
variants of the local lemma: the (IGLLl) and (ICLLl) criteria. We present new combinatorial connections 
between these criteria and Shearer’s criterion, which in turn imply our main results on the efficiency of our 
algorithm under the (IGLLl) and (ICLLl) criteria (in Sections [53] and [XU respectively). 
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5.1 Stable set sequences and the coupling argument 

An important notion in our analysis is that of stable set sequences. We note that this concept originated in 
the work of Kolipaka and Szegedy |[24]| which builds on Shearer’s work 13611 . There are some similarities 
but also differences in how this concept is applied here: most notably, our stable set sequences grow forward 
in time, while the stable set sequences in flM grow backward in time (which is similar to the Moser-Tardos 
analysis lISOll '). 

Definition 5.1. One execution of the outer repeat loop in MaxinialSetResample is called an iteration. For a 
sequence of non-empty sets I = (Ii,..., f), we say that the algorithm follows Z if Ig is the set resampled 
in iteration s for 1 < s < f, and f is a set of the first m events resampled in iteration tfor some m > 1 (a 
prefix of the maximal independent set constructed in iteration t). 

Recall that Ind = Ind(G) denotes the independent sets (including the empty set) in the graph under 
consideration. 

Definition 5.2. Z = (/i , / 2 ,..., It) is called a stable set sequence ifli,... ,It ^ Ind(G) and Ig+i Z r+(/s) 
for each 1 < s < t. We call the sequence Z proper if each independent set Ig is nonempty. 

Note that if /^ = 0 for some s, then It = il) for all t > s. Therefore, the nonempty sets always form a 
prefix of the stable set sequence. Formally, we consider an empty sequence also a stable set sequence, of 
length 0. 

Lemma 5.3. If MaximalSetResample follows a sequence J = (Ji,..., Jt), then J is a stable set sequence. 

Proof. By construction, the set Jg chosen in each iteration is independent in G. For each i G Jg, we execute 
the resampling oracle r*. Recall that r* executed on a satisfied event Et can only cause new events in the 
neighborhood r+(i) (and this neighborhood is not explored again until the following iteration). Since Jg is 
a maximal independent set of satisfied events, all the events satisfied in the following iteration are neighbors 
of some event in Jg, i.e., Jg+i C r+( J^). In the last iteration, this also holds for a subset of the resampled 
events. □ 

We use the following notation: For i G [n]. Pi = PTf,[Ei]. For S' C [n], = 0 i^sPi- ^ Stable set 

sequence Z = (Ii,..., f), px = 0^=1 We relate stable set sequences to executions of the algorithm by 
the following coupling argument. Although the use of stable set sequences is inspired by ll^ . their coupling 
argument is different due to its backward-looking nature (similar to If30]l ). and their restriction to the variable 
model. 

Lemma 5.4. For any proper stable set sequence Z = (Ii, I 2 ,..., It), the probability that the MaximalSet¬ 
Resample algorithm follows Z is at most px. 

Proof. Given Z = {Ii,l 2 ,..., It), let us consider the following “X-checking” random process. We start 
with a random state a; ~ /x. In iteration s, we process the events of Ig in the ascending order of their indices. 
For each i G Ig, we check whether ui satisfies Ep, if not, we terminate. Otherwise, we apply the resampling 
oracle r* and replace io by ri{uj). We continue for s = 1,2,... ,t. We say that the X-checking process 
succeeds if every event is satisfied when checked and the process runs until the end. 

By induction, the state io after each resampling oracle call is distributed according to p: Assuming this 
was true in the previous step and conditioned on Ei satisfied, we have uj ~ p\Ei- By assumption, the re¬ 
sampling oracle rj removes this conditioning and produces again a random state rj(a;) ~ p. Therefore, 
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whenever we check event Ei, it is satisfied with probability Pr^[ii^j] (conditioned on the past). By a tele¬ 
scoping product of conditional probabilities, the probability that the X-checking process succeeds is exactly 
Ul=i riie/. = Ui=iP^‘ = PI- 

To conclude, we argue that the probability that MaximalSetResample follows the sequence 1 is at most 
the probability that the X-checking process succeeds. To see this, suppose that we couple MaximalSet¬ 
Resample and the X-checking process, so they use the same source of randomness. In each iteration, if 
MaximalSetResample includes i in Jt, it means that Ei is satisfied. Both procedures apply the resampling 
oracle r/(a;) and by coupling the distribution in the next iteration is the same. Therefore, the event that 
MaximalSetResample follows the sequence X is contained in the event that the X-checking process succeeds, 
which happens with probability px- □ 

We emphasize that we do not claim that the distribution of the current state a; € is /r after each resam¬ 
pling oracle call performed by the MaximalSetResample algorithm. This would mean that the algorithm is 
not making any progress in its search for a state avoiding all events. It is only the X-checking process that 
has this property. 

Definition 5.5. Let Stab denote the set of all stable set sequences and Prop the set of proper stable set 
sequences. Let us denote by Stab^ the set of stable set sequences (Ii,..., Ii) of length I, and by Stab^(J) 
the subset o/Stab^ such that the first set in the sequence is J. Similarly, denote by Prop^ the set of proper 
stable set sequences of length I, and by Prop(J) the subset o/Prop such that the first set in the sequence is 
J. Fori = (Li,..., It) G Prop, let us call cr(I) = Yl^s=i total size of the sequence. 

Lemma 5.6. The probability that MaximalSetResample runs for at least i iterations is at most X^xeProp^ Pi- 
The probability that MaximalSetResample resamples at least s events is at most 'f2i^Prop-a{i)=sPi- 

Proof. If the algorithm runs for at least £ iterations, it means that it follows some proper sequence X = 
(Ii, I 2 ,..., Ii). By Lemma [ 5 ^ the probability that the algorithm follows a particular stable set sequence X 
is at most px. By the union bound, the probability that the algorithm runs for at least £ iterations is at most 

J2x={h,...,le)ePmpPl- 

Similarly, if the algorithm resamples at least s events, it means that it follows some proper sequence X 
of total size cr(X) = s. By the union bound, the probability of resampling at least s events is upper-bounded 
'^XGProp:a{X)=sPl- 

We note that these bounds could be larger than 1 and thus vacuous. The events that “the algorithm 
follows X = {h,... ,Ie)” are disjoint for different sequences of fixed total size cr(X), while they could 
overlap for a fixed length £ (because we can take Ii to be different prefixes of the sequence of events 
resampled in iteration t). In any case, the upper bound of px on each of the events could be quite loose. 

5.2 A simple analysis: the General Lovasz Lemma criterion, with slack 

In this section we will analyze the algorithm under the assumption that the (IGLLI) criterion holds with some 
“slack”. This idea of exploiting slack has appeared in previous work, e.g., OOl [T3l l20l 1^ . This analysis 
proves only a weaker form of Theorem [T2] The full proof, which removes the assumption of slack, appears 
in Section [ 5 ^ 

To begin, let us prove the following (crude) bound on the expected number of iterations. We note that 
this bound is typically exponentially large. 
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Lemma 5.7. Provided that the pi satisfy the (IGLLI) criterion, pi < Xi njer(i)^^ ~ have 

n 

E px fi rir^' 

I — Xi 

iGProp 1=1 

Proof. It will be convenient to work with sequences of fixed length, where we pad by empty sets if necessary. 
Note that by definition this does not change the value of px'- e.g., ~ P{h,i2,^,-,9)- Recall that 

Stab£(J) denotes the set of all stable set sequences of length i where the first set is J. We show the 
following statement by induction on £: For any J € Ind and any £ > 1, 

E K £ Dr^- w 

ISStab^f J) 


_ — Xi 

jeJ ^ 


This is true for £ = 1, since p(j) = p'^ < OjeJ LLL assumption. Let us consider the 

expression for £ + 1. We have 


PI' = ^ p'^ U 

Z'GStab^+l(J) J'Cr+(J)XGStab£(J') J'Cr+(J) iGJ' 

by the inductive hypothesis. This can be simplified using the following identity: 

n (!+"*)= Yi ii"*' 

iGr+(J) /iCr+(J)iGF 

We use this with a* = -r-^. Therefore, 

i Xi 


Xi 


1 - Xi 


pi'<p' n (i+T^) =^■'11 


I'£Stab£^_l(J) 

Now we use the LLL assumption: 


ter+(j) 


1 


iGr+(J) 


1 - Xi 


(5) 


/ = 


ieJ iGJ \ ier(i) / ieJ jer+(j)\j 

because each element of r+( J) \ J appears in r(z) for at least one i ^ J. We conclude that 


E pr < u^i n (1-^^) n = Hi 

X'GStab^(J) ieJ jGr+(J)\J i'Gr+(J) * ■ ■ 

This proves dH). 

Adding up over all sets J C [n], we again use dD to obtain 


leJ 


E pxs Enr^-n^-i^ - Hr 

JC[n]iGJ ^ i=l ^ 


XGStab^ 

As we argued above, this can be written equivalently as 

i n 


i=l 


- Xi 


E E ^ n I 


1 


- Xi 


/c=lXGProp i=l 

Since this is true for every £, and the left-hand-side is non-increasing in £, the sequence as £ —)■ oo has a 
limit and the bound still holds in the limit. □ 
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The following is our first concrete result: our algorithm is efficient if (IGLLI) is satisfied with a slack. 


Theorem 5.8. If (IGLLI) is satisfied with a slack of e, i.e. 

Pr[-Ej] < (1 - e)xi n (1 - Xj) 

^ iGr(i) 

then with probability 1 — MaximalSetResample resamples at most + 'Y^=i 

Proof By Lemma [531 the probability that MaximalSetResample resamples more than s events is at most 
Z^lGProp: a{X)=\s \ Pi where px is the product of pi = Pr^[i?i] over all events in the sequence X. By the 

slack assumption, we have pi < {1 — e)p\ and px < (1 — where p' = Xi njgr(i)(l “ ^i)- Using 

Lemma 1531 we obtain 


Y. n< (i-e)' E A ^ 

XeProp 2=1 


X^Prop 

o'(2^)=M 


l-Xi 


For s = + Ya=i In rxF)’ obtain 


E K £ e-n 


XeProp 

a{I)=\s^ 


2=1 


1 - Xi 


< e"^ 


Therefore, the probability of resampling more than s events is at most e 


□ 


5.3 Preliminaries on Shearer’s criterion 

In this section we discuss a strong version of the local lemma due to Shearer 13^ . Shearer’s lemma is based 
on certain forms of the multivariate independence polynomial. We recall that p^ denotes Hie/ Pi- 

Definition 5.9. Given a graph G and values pi,..., p„, define for each S C [n] 


qs = qs{p) = 


( 6 ) 


/Gind 

SCI 


Note that = 0 for S' ^ Ind. An alternative form of these polynomials that is also useful is obtained 
by summing over subsets of S'. 

Definition 5.10. Given a graph G and values pi,... ,Pn, define 

(p) = Y (-i)'^'p^- 


qs = qs[p) = 


/eind 

ICS 


The following set plays a fundamental role. 

Definition 5.11. Given a graph G, the Shearer region is the semialgebraic set 

S = { p G (0,1)"' : VI € Ind, qi{p) > 0 } 

= { p G (0, 1)*" : yS C [n], qs{p) > 0 } 


(7a) 

(7b) 
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The equivalence between (TTab and (iTbl) is proven below in Claim [5T9l 
Shearer’s Lemma can be stated as follows. 

Lemma 5.12 (Shearer Obi ). Let G be a lopsidependency graph for the events Ei,, En. Let pi = 
Vi^[Ei] € (0, 1). Ifp € S then Pr^[n"=i Ei] > Qd,. 

It is known that Shearer’s Lemma implies Theorem [TT] as we will see in Section |5^ and in fact gives 
the tight criterion under which all events can be avoided for a given dependency graph G. The polynomials 
qs{p) and qs{p) have a natural interpretation in the Shearer region: There is a “tight instance” where qs{p) 
is the probability that the set of occurring events is exactly S, and qs{p) is the probability that none of the 
events in S occur. In particular, 50 (p) = q[n] (p) is exactly the probability that no event occurs. (See Obi for 
more details.) 

5.3.1 Properties of independence polynomials 

In this section we summarize some of the important properties of these polynomials, most of which may be 
found in earlier work. Since some of the proofs are not easy to recover due to different notation and/or their 
analytic nature (in case of |[35l ). we provide short combinatorial proofs for completeness. 

Claim 5.13 (The “fundamental identity”. Shearer Obi . Scott-Sokal 051 Eq. (3.5)]). For any a £ S, we have 

QS = QS\{a} - Pa ■ qs\r+{a)- 

Proof. Every independent set / C 5 either contains a or does not. In addition, if a € / then L is independent 
iff L \ {a} is an independent subset of S' \ r+(a). □ 

Claim 5.14 (Shearer Obi . Scott-Sokal 051 Eq. (2.52)]). For every S C [n], 

qs = qr- 

yc[n]\s 

Proof. By definition of qy, 

E w = E E (-1)''"'/' = E !>' E (-1)'''*''. 

yc[n]\5 yc[n]\s.^^nd /Gind YCI\S 

If I\S 0 then the last alternating sum is zero. Therefore, the sum simplifies fo ~ 

as required. □ 

Claim 5.15 (Shearer Obi ). 

JGind S'C[n] 

Proof. Sef S = 0 in Claim I5T4I and use fhe facf fhaf % = 1. □ 

Claim 5.16 (Scoff-Sokal 051 Eq. (2.48)]). For / G Ind, 

qi = p^ ■ q[n\\r+(l)- 
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Proof. Given I G Ind, each independent set J ^ I can be written uniquely as J = I L) K where K is 
independent and K n r'^(I) = 0. So, 

< 11 = E ^ 

Jelnd:/CJ ireind 

KQ[n]\T+(I) 


□ 


Lemma 5.17 (Kolipaka-Szegedy ll^ Lemma 15]). For any I G Ind 

qi = (is- 


scr+{7) 

Proof. By Claim 15 .161 and Claim 157141 we have qj = ■ q[n]\r+{i) = Z]scr+( 7 ) 

Claim 5.18 (Simultaneous positivity of qs and qs). Assume that p G [0, l]"^. Then 


□ 


qi >0 VI G Ind qs > q% VS" C [n] (8) 

qs >0 yS C [n] qi> VI G Ind. (9) 

Proof. (I8]l follows from Claim |5T4| (since qy = 0 for L ^ Ind). To see (|9ll, first note that g/ > 0 for all 
I G Ind, by Claim [5C6l Consequently, by Claim l5T^ qy^] = mins qs- Clearly, = min/ p^ . It follows 
from Claim [ 5 TT 6 ] again that qj = p^ - g[n]\r+( 7 ) > ' 9[n]- D 

Claim 5.19. The two characterizations of the Shearer region, (17^ and (I7bl) . are equivalent. 

Proof. By Claim l5T8l if g 0 > 0 and qs > O'^S C [n], then qs > 0 for all 5 C [n]. Conversely, if gs > 0 
for all S C [n], then g/ > p^’^^gjn] > 0 for all I G Ind. □ 

Claim 5.20 (Monotonicity of g, Scott-Sokal |[35l Theorem 2.10]). Let p G [0,1]”. 

qs{p) > 0 VS C [n] ^ gs(p') > qs{p) VO < p' < p, VS C [n]. 

Proof. First consider the case that p and p' differ only in coordinate i. For any S C [n]. Claim l5T3] implies 

Wi^s{p) = -qs\T+{{){p) and ^qs = 0. Thus, 

qs{p') = qs{p) + {pi -p'i) ■ qs\r+ii){p) > qs{p)- 

The case that p' and p differ in multiple coordinates is handled by induction. □ 

Claim 5.21 (Log-submodularity of qs, Scott-Sokal ll35l Corollary 2.27]). For any p £ S and A,BC [n], 
we have qA-qB> qAuB ■ qAnB- 

Proof. We claim that for any a £ S C T, v/e have 


qs ^ qr 

qS\{a} ~ qT\{a} 


( 10 ) 


By induction, this implies that for any R F S, 
T = AU B, and R = A \ B. 


QS > <7t 

Qs\r ~ Qt\r ' 


We obtain the claim above by setting S 


21 , 
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We prove (fTOl) again by induction, on |r|. For |T| = 1, the statement is trivial. Let \T\ > 1. By 
Claim [ 5 TT 3 I we have 

QS = QS\{a} - Paqs\r+{a) 

and 


qr = qT\{a} - PaqT\r+{a)- 

Let US denote S D r+(o) = {a, si,..., s^}. We apply (fTOl) to strict subsets of S and T, to obtain 

w k ^ few w w 

qs\r+{a) ^ T-r qs\{a,si,...,si.usi} ^ -rr gr\{a, 6 i,..,s,_i,^i} ^ qT\{Snr+{a)) ^ gT\r+(a) 

qS\{a} qS\{a,su-,Si_i} ~ qT\{a,si,-,Si-i} qT\{a} ~ gT\{a} 

where in the last step we used the monotonicity of qt in T (again from Claim I5TT3] ). This implies (ITOl ): 


qs _ , ^ qs\r+(a) ^ ^ gT\r+(a) _ qr 

^ Pa ^ Pa ^ ^ 

qs\{a} 9s\{a} 9T\{a} ^T\{a} 


□ 


Claim 5.22 (Log-submodularity of qs). For any p € S and A,BC [n], we have qA ■ qs > dAuB ■ gAns- 
Proof We can assume Au B ^ Ind; otherwise the right-hand side is zero. By Claim ISTThl we have 

qA-qB = P^qin]\r+{A) ■ P^q[n]\r+{B)- 

By Claim|5]2T] 

g[n]\r+(A) ■ g[n]\r+(s) > g[n]\(r+(A)ur+(s)) • g[n]\(r+(A)nr+(B))- 
Here we use the fact that r+(A) U r+(H) = r+(A U B), and r+(A) n r+(H) D r+(A n B). Therefore, 
by the mono tonicity of qs, 


g[n]\r+(A) • g[n]\r+(B) > g[n]\r+(AuS) ‘ g[n]\r+(AnB) • 

Also, = p^uB^Ans Using Claim l5T^ one more time, we obtain 

qA-qB > P^'^^q[n]\r+{AUB) ■ P^'^^q[n]\r+{AnB) = gAuB-gAnB- 


□ 


Claim 5.23. Suppose that p G S. For any set S C [n], 


n(i + 


g0 


JCS j€S 

Proof. The proof is by induction on S, the case IS"! < 1 being trivial. Fix any s € S. Claim |5]22 implies 
that qj^s ■ (70 < • qj for any J C 5 \ {s}. Summing over J yields 


E 


qj+s ^ q{s} 


gJ 


JCS\{s} JCS\{s} 


Adding E jcb\{s} ^ 


E!: s (1 + 


m 


JCS 


E 

JCS\{s} 


gj 


The claim follows by induction. 


□ 
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Claim 5.24. //or > 0 then ^ - 1. 

■’ 90 <?[«] 

Proof. By Claim [5TT41 

1 + ^ 

90 90 


gM\{»} 

9 [n] 


□ 


Claim 5.25 (Kolipaka-Szegedy Il24l Theorem 5]). ^(1 + e)p G S then < \for each i G [n]. 

Proof. Note that q[n]\{i}{p) does not depend on pi, while q[n]{p) is linear in pi. Also, both quantities are 
equal at pi = 0: we have q[n]{pi, ■ ■ ■ ,0 ■ Pi, - ■ ■ ,Pn) = q[n]\{i}{p)- Since (1 + e)p G S, we know that 
q[n\{pi, ■ ■ ■, (1 + e)Pi, • • • ,Pn) >0. By linearity, q[n\{p) > j^q[n]\{i}{p)- Claim 15.241 then implies that 


5.3.2 Connection to stable set sequences 

Kolipaka and Szegedy showed that stable set sequences relate to the independence polynomials qs. The 
following is the crucial upper-bound for stable set sequences when Shearer’s criterion holds. In fact, this 
result is subsumed by Lemma [5. 27 1 but we present the upper bound first, with a shorter proof. 

Lemma 5.26 (Kolipaka-Szegedy ijldl l. If qs > D for all S C [n] and q^ > 0, then 

Px < — VJ G lnd,V£ > 1 . 

0(7) 

XGStab^(J) 


Proof. We proceed by induction: for £ = 1, there is only one such stable set sequence X = (J). By 
Lemma |5.17[ we have qj = P'^ Ylscr+{j) > 7''^90- (Recall that > 0 for all S C [n].) Hence, 

P{j) =p-^ < 9j/90- 

The inductive step: every stable set sequence starting with J has the form I = (J, J',. • •) where 
J' C r+(J). Therefore, 

X] PI = X] X] PI- (11) 

leStab^fJ) J'gind l£Stab^_i (J') 

j'cr+(j) 

By the inductive hypothesis, XiGStab^_i(j') Pi- — 9J'/90- Also, recall that qj' = 0 if J' ^ Ind. Therefore, 


^ J sr^ qj' qj 
> PI<P > — = — 

leStabAJ) J'cr+(j) ® 


using Lemma B.ni to obtain the last equality. □ 

The inequality in Lemma B.261 actually becomes an equality as £ ^ oo, as shown in Lemma B.271 This 
stronger result is used only tangentially in Section B. 7. 21 but we provide a detailed proof in order to clarify 
the arguments of Kolipaka and Szegedy |[24l . 

Lemma 5.27 (Kolipaka-Szegedy lEH Theorem 14]). For a dependency graph G and pi,... ,pn G (0,1), 
the following statements are equivalent: 


1. 50 > 0 and qs > Ofor all S C [n]. 
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2. for all J e Ind, gj > 0 W X;i 6 Prop(j) Pi = qj/qfD- 

3. px is finite for each J G Ind. 

Proof First, note that Prop(J) = IJSi |Jf=i be identified with Stab£(J), 

since each proper sequence Z of length at most I can be padded with empty sets to obtain a sequence 
in Stab^(J) (and px does not change). Therefore, ExeProp(j)Pi = bm^^oo ExGStab^(j) Pi- This is a 

non-decreasing sequence; the limit exists but could be infinite. Let us denote Wj^ = X^igstab^(j)Pi 
wy = lim£_^cx 3 = Yx£Prop(j) Pi- define M to be the following linear operator on 

{Mx)i =p^ ^ xj. 

Jelnd 

jcr-^(/) 


Using this notation, the identity (fTTI) can written compactly as Inductively, = 

and w* = lim^^oo 

1^2: Assume now that gs > 0 for all S C [n] and > 0. Lemma 15.261 proves that this implies 

'W*J = ExGProp(j)Pi = lim^^oo ExeStabAJ)Pi ^ Clearly ExGProp(j)Pi > 0> so this also implies 

that qj > 0 for all J G Ind. 

Note that is the column of M corresponding to J = 0: M/ 0 = p^ for each / G Ind. Therefore, we 
can write = Mw^^\ where = e 0 is the canonical basis vector in corresponding to 0. We have 
w* = lim^_).oo = lim£_^oo We may subtract these two limits since we have shown that 

every w*j is finite, obtaining lim£_).oo = 0. We note that has strictly positive 

coordinates for / 7 ^ 0 , and 0 for / = 0 . 

By Lemma B.171 we have Mg = g for the vector g G with coordinates g/. Consider ^g — a 
nonnegative vector with 0 in the coordinate corresponding to 0. We can choose /3 > 0 large enough so that 
coordinate-wise, 0 < ^g — From this we derive that 


0 < —g — w* = lim f—g — tu® ) < fi lim = 0, 

g0 e^oo yg0 J e^oo 

so equality holds throughout. Recalling the definition of w*j, we conclude that EigProp{j) Pi = w*j = —qj. 

2 => 3: Trivial. 

3^1: Let p G (0, !)"■ be the vector (pi,... ,Pn)- We can assume that min^ qs{p) < 0, otherwise 
we are done by Claim HHH Let us consider the values of qs on the line { Xp : A G [0,1] }. Define A* = 
inf{A G (0,1] : min 5 g 5 (Ap) < 0}. We observe thaf min 5 g 5 (Ap) > 0 for 0 < A < 1/n, which can 
be verified direcfly by considering the alternating sum defining qs- (Intuitively, Shearer’s Lemma holds 
in this region just by the union bound.) Therefore, we have A* > 0. Furthermore continuity also implies 
ming qsiX*p) = 0, so Claim lSU^ vields g 0 (A*p) = g[„] {X*p) = 0. For A G [0, A*) we have min^ qsiXp) > 
0, so by Claim lSU^ we also have min/gind qi{Xp) > 0. This shows that the condition 1 holds at the point Xp, 
for A G [0, A*), so we may use the implication 1 2: EiGProp(j)('^P)i = qj{Xp)/q%{Xp). Let J G Ind 

be such that qj{X*p) > 0; such a J must exist by Claim l5TT5l By the monotonicity of px = O/gxP^ ’^be 
variables pi, ... we have 


PX > {X*p)x > bmmj Y 

ZGProp(J) XGProp(J) XGProp(J) 


liminfi#) 

A^A*- g0(Ap) 


00, 


as qj{X*p) > 0 but q%{X*p) = 0. This contradicts the assumption 3 that EieProp(j) Pi finite. □ 
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From Claim I 5 T 5 I we obtain immediately the following. 


Corollary 5.28. If qs > 0 for all S C [n] and q^) > 0, 

XeProp 


1 


Remark. An equivalent statement using the language of “traces” appears in the recent manuscript of Knuth 
Page 86, Theorem F], together with a short proof using generating functions. Furthermore, using 
Claim [ 5 TT 4 I we may derive 


E E J'x 

JCA iGProp(J) 


E 

JCA 


<1J_ 

% 


Q[n]\A 

Q[n] 


for any A C [n]. This statement, in the language of haces, also appears in Knuth’s draft 1231 Page 87, 
Equation (144)]. 


Summary at this point. By Lemma 1531 and Corollary 15.281 MaximalSetResample produces a state in 
Ei after at most 1/(70 iterations in expectation. However, this should not be viewed as a statement of 
efficiency. Shearer’s Lemma proves that Pr^[P/^^ Ei] > q^ so, in expectation, 1/(70 independent samples 
from fj, would also suffice fo find a sfate in 0 ?=! 

Section lS^ improves this analysis by assuming that Shearer’s criterion holds with some slack, analogous 
to the result in Section l5^ Section l53] then removes the need for that assumption — it argues that Shearer’s 
criterion always holds with some slack, and provides quantitative bounds on that slack. 


5.4 Shearer’s criterion with slack 

In this section we consider scenarios in which Shearer’s criterion holds with a certain amount of slack. To 
make this formal, we will consider another vector p' of probabilities with p < p' ^ S. Lor notational 
convenience, we will let q'g denote the value qs{p') and let qs denote qs{p) as before. Let us assume that 
Shearer’s criterion holds with some slack in the following natural sense. 

Definition 5.29. We say that p G (0,1)” satisfies Shearer’s criterion with coefficients q'g at a slack of e, if 
p' = {\ + e)p is still in the Shearer region S and q'g = qs{p')- 

Theorem 5.30. Recall that pi = Pr^[i7j]. If the pi satisfy Shearer’s criterion with coefficient q'^ at a slack 
of e ^ (0,1), then the probability that MaximalSetResample resamples more than In ^ + f) events is at 
most e“*. 

Proof By Lemma [531 the probability that MaximalSetResample resamples more than s events is at most 
ZlxeProp: ct(X)=[s] P^- slack assumption, we have 

Pr [resample more than s events] < PI < (1 + e) * X] P'l 

X^Prop XGProp 

<^{P)=\s\ <t{X)=\s\ 

since we have p' = (1 + e)pi for each event appearing in a sequence I. The hypothesis is that the 
probabilities p' satisfy Shearer’s criterion with a bound of Consequently, Corollary I5.28l implies that 
EieProp:^(i)=pi Px < EiePropPi < Thus, for s = f (In ^ + t) we obtain 

Pr [resample more than s events] < (l + e)“®-r < = e“*. 

^0 ^0 ^0 
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□ 


In other words, the probability that MaximalSetResample requires more than | ln(l/q 0 ) resamplings 
decays exponentially fast; in particular the expected number of resampled events is . This 

appears significantly better than the trivial bound of l/g 0 ; still, it is not clear whether this bound can be 
considered “polynomial”. In the following, we show that this leads in fact to efficient bounds, comparable 
to the best known bounds in the variable model. 


Corollary 5.31. If the pi satisfy Shearer’s criterion with coefficients q'g at a slack of e & (0,1), then the 
probability that MaximalSetResample resamples more than 



events is at most e *. 

Proof By Claim [5?T5] and Claim 15^2^ we have 



The result follows from Theorem 15.301 □ 


Next, we provide a simplified bound fhaf depends only on fhe amounf of slack and fhe number of evenfs. 
This is analogous fo a bound of 0(n/e) given by Kolipaka-Szegedy flM in fhe variable model. 


Theorem 5.32. Ifpi,... ,pn satisfy Shearer’s criterion at a slack ofe G (0,1), then the expected number of 
events resampled by MaximalSetResample is 0{j log ^). 


Proof Lefp' = (1 + ej2)p. By assumpfion, (1 + e/3)p' < (1 + e)p € S. Therefore, p' still has e/3 slack 
so by Claim l5^25l fhe coefficienfs q'g = qs{p') satisfy The poinf p safisfies Shearer’s criferion 

wifh coefficienfs q'g af a slack of e/2, so by Corollary 15 311 fhe probabilify fhaf we resample more fhan 
|(reln(l + -) + t) evenfs is af mosf e“*. In expecfafion, we resample 0{j log i) evenfs as claimed. □ 


5.5 Quantification of slack in Shearer’s criterion 

In fhe previous secfion, we proved a bound on fhe number of resamplings in fhe MaximalSefResample 
algorifhm, provided fhaf Shearer’s criferion is satisfied wifh a cerfain slack. In facf, from Definilion 15.Ill 
one can observe fhaf fhe Shearer region is an open set and Iherefore fhere is always a certain amounf of 
slack. However, how large a slack we can assume is nol a priori clear. In particular, one can compare wifh 
Kolipaka-Szegedy f24l where a bound is proved on fhe expected number of evenfs one has fo resample 
in fhe variable model: If Shearer’s criferion is satisfied wifh coefficienfs qs, then the expected number of 
resamplings is at most (!{%}/% I211- In this section, we prove that anywhere in the Shearer region, 
there is an amount of slack inversely proportional to this quantity, which leads to a bound similar to that of 
Kolipaka and Szegedy 12^ . 

Lemma 5.33. Let (pi,... ,pn) G (0,1)’^ be a point in the Shearer region. Let e = q%/i2 9{*}) 

p' = (1 + e)pi. Then (p'^,... ,p^) is also in the Shearer region, and q(/i{p') > \%{p)- 
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Before proving the lemma, let us consider the partial derivatives of the qs polynomials. 


Claim 5.34. For any i £ S, 


dqs 

~dii~ ~ 


and for any j G S \ r+(i), 


d^qs 

dpidpj 


For other choices of i,j, the partial derivatives 

^<0and > 0. 


dpi — 


dpidpj — 


- qs\r+{i)\r+ij)- 

are 0. In particular, for any point in the Shearer region, 


Due to Claim [534l we may say that qs{pi, ■ ■ ■ ,Pn) is “continuous supermodular” in the Shearer region. 


Proof For any i £ S, have qs = qs\{i} ~ Piqs\r+(i) by Claim [5TT^ The polynomials qs\{i} and 
qs\r+{i) do not depend on pi and hence is equal to —qs\r+(i)- Repeating this argument one more time 
for j G S’\r+(i), we get ^ = -qs\r+(i) = -qs\r+(i)\{j} + Pjqs\r+(i)\r+(j)- Again, qs\r+(i)\{j} and 
qs\r+(i)\r+(j) do not depend on pj and hence = qs\r+(i)\r+(j)- 

Clearly, we have = 0 unless i £ S, and = 0 unless i £ S and j £ S \ r+(i). Since all the 

coefficients ijs are positive in the Shearer region, we have < 0 and > 0 for alH, j. □ 

Now we can prove Lemma l5.33l 


Proof Consider the line segment from p = 

^ Ft < (1 + ^)P^ 

and Claim [S^ni Let us define 


(pi,... ,pn) to p' = (p'l,... ,p'„) where p' = (1 + e)pi, 

= Pi < 1 by Claim EH Claim EHl 

tt*} P*9[n]\r+(i) 


QttW = 90((i + a)pi,...,(i + a)p„). 
By the chain rule and Claim [5341 we have 


dQd^ 


dX 


A=0 


n rs 

dpi 

2=1 




n]\r+(i) 


2=1 


2=1 


0 


where we used Claim 15361 in the last equality. Assuming that (1 + A)p = ((1 + A)pi,..., (1 + A)p„) is in 
the Shearer region, we also have by Claim l534l 


d^Qm 

dA2 


E 


i,j=t 


d'^qa 

dpidpj 


PiPj > 0. 


That is, Q%{X) is a convex function for A > 0 as long as (1 + A)p is in the Shearer region. Our goal is to 
prove that this indeed happens for A £ [0, e]. 

Assume for the sake of contradiction that (1 + A)p is not in the Shearer region for some A G [0, e], 
and let A* be the minimum such value (which exists since the complement of the Shearer region is closed). 
By Claim |538l anywhere in the Shearer region, (70 = q^^] is the minimum of the qs coefficients; hence by 
continuity it must be the case that q[n]((l + A*)p) is the minimum coefficient among qs((l + A*)p) for all 
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S C [n], and ( 50 (A*) = g[n]((l + ^*)p) < 0. On the other hand, by the minimality of A*, Q 0 (A) is positive 
and convex on [0, A*) and therefore 


Q 0 (A*) > Q 0 (O) + A' 


dQih 


dX 


A=0 


= 




— e 






2=1 


2=1 


1 

2^0 > 


which is a contradiction. Therefore, <30(A) is positive and convex for all A G [0, e]. By the same computation 
as above, Q 0 (e) > □ 


This implies our main algorithmic result under Shearer’s criterion. 


Theorem 5.35. Let Ei,... ,En be events and let pi = Pr^[Sj]. Suppose that the three subroutines de¬ 
scribed in Section Vl . 1. 1\ exist. Ifp G S then the probability that MaximalSetResample resamples more than 
4 EILi ^ ( Ei=i ln(l + + l + t) events is at most e”*. 

We note that the corresponding result in the variable model flM was that the expected number of resam¬ 
plings is at most EILi Here, we obtain a bound which is at most quadratic in this quantity. 


Proof. Directly from Theorem 15.301 and Lemma 15.331 Given p in the Shearer region. Lemma 15 .33 1 implies 
that p in fact satisfies Shearer’s criterion with a bound of ^ at a slack of e = ^/Er=i ?{*}• Hy 

Theorem 15.301 the probability that MaximalSetResample resamples more than s events is at most e“*. 


where 


s = - In — + t 

e V «0 


< — 


2=1 


In — + 1 + t 
Qm 


Using Claim 15^231 we can replace In ^ by Ej=i ^^(1 + ^^)- 


□ 


5.6 The General LLL criterion, without slack 


Shearer’s Lemma (Lemma [5. 121) is a strengthening of the original Lovasz Local Lemma (Theorem 1 1.11) : if 
pi,... ,Pn satisfy (IGLLI) then they must also satisfy Shearer’s criterion p ^ S. Nevertheless, there does not 
seem to be a direct proof of this fact in the literature. Shearer |[3^ indirectly proves this fact by showing 
that, when p ^ 5 it is possible that Pr[P|”^^ Ei] = 0, so the contrapositive of Theorem 1 1.1 1 implies that 
(IGLLI) cannot hold. Scott and Sokal prove this fact using analytic properties of the partition function |[351 
Corollary 5.3]. In this section we establish this fact by an elementary, self-contained proof. 

We then establish Theorem 11.21 our algorithmic form of Theorem 11.11 in the general framework of 
resampling oracles. Unlike the simpler analysis of Section 15^ the analysis of this section does not assume 
any slack in the (IGLLI) criterion. 

Lemma 5.36. Suppose that p satisfies (IGLLI) . Then, for every S C [n] and a & S, we have 


qs\{a} 

Corollary 5.37 ( (IGLLI ) implies Shearer). Ifp satisfies (IGLLI ) then p ^ S. 


Proof. For any S' C [n], write itasS = {si,...,Sfc}. Induction yields 


m 


> 0- 

i=l 


The claim follows since q 0 = 1. 
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Corollary 5.38. If p satisfies (IGLLI) then . 

Proof Lemma B . 3 6 1 yields < ,f , so the result follows from Claim|5]25] □ 

y[n] ^cb 

Proof (of Lemma l5.36l) . We proceed by induction on |S'|. The base case, S' = 0, is trivial: there is no a G S' 
to choose. Consider 5 7 ^ 0 and an element a € S'. By Claim |5T3l we have qs = Qs\{a} — PaQs\r+{a)- By 
the inductive hypothesis applied iteratively to the elements of (S' \ {a}) \{S\ r+(a)) = r(a) fl S, we have 

Qs\{a} > Qs\r+(a) n 

iGr(a)n5 

Therefore, we can write 

. . A Pa 

QS = QS\{a} - Paqs\r+{a) > QS\{a} 1 “ ^ 

V lliGr(a)nsl-‘- 

By the claim’s hypothesis, pa < Xa niGr(a)(l “ ^ nier(a)ns(l “ conclude that qs > 

(1 - Xa)g5\{a}- □ 

These results, together with our analysis of Shearer’s criterion with slack (Corollary 15.311 ). immediately 
provide an analysis under the assumption that (IGLLI) holds with slack, similar to Theorem 15.81 However, 
this connection to Shearer’s criterion allows us to prove more. 

We show that our algorithm is in fact efficient even when the (IGLLI) criterion is tight. This might 
be surprising in light of Corollary 15.281 which does not use any slack and gives an exponential bound of 
^ ^ < nr=i tA”- reason why we can prove a stronger bound is that Shearer’s criterion is never 

tight, as we argued already, it defines an open sef, and Section [53] derives a quantitative bound on the slack 
that is always available under Shearer’s criterion. 

Theorem 5.39. Let Ei,..., En be events and letpi = Pr^[L^j]. Suppose that the three subroutines described 
in Section 17. i. i I exist. If p satisfies (IGLLI) then the probability that MaximalSetResample resamples more 
than 4 tAt + 1 + f) events is at most e~^. 

If (IGLLI ) is satisfied with a slack of e G (0,1), i.e., (1 + e)pi < Xi njGr(i) “ ^j)’ probability 

at least 1 — e“*, MaximalSetResample resamples no more than i-x events. 

Proof. The first part follows directly from Theorem 15.351 since Corollary 15.371 shows that p ^ S and 
Corollary 15.381 shows that ^ - At -. The second part follows from Corollary 15.311 using again that 

Xj Q 

Q0 1 Xi 

Theorem [L2] follows immediately from Theorem [5391 

5.7 The cluster expansion criterion 

Recall that Section 11.41 introduced the cluster expansion criterion, which often gives improved quantitative 
bounds compared to the General LLL (such as the applications discussed in Section[4|). For convenience, let 
us restate the cluster expansion criterion here. Given parameters yi,..., define the notation 

ys=^y' vs C [n]. 

ICS 

IClnd 
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The cluster expansion criterion for a vector p G [0,1]”, with respect to a graph G, is 

32 /1 ,..., yn > 0 such that Pi < yi/>r+(i)- (CLL) 

This criterion was introduced in the following non-constructive form of the LLL. 

Theorem 5.40 (Bissacot et al. f8|). Let Ei,..., be events with a (lopsi-)dependency graph G, and let 
Pi = Pr^[ii^i]. Ifp and G satisfy (ICLLI) then Pr^[niLi ^i] > 0- 

To see that this strengthens the original LLL (Theorem (LT]), one may verify that (IGLLI) implies (ICLLI) : 
if Pi < Xi njer(i)(^ “ Hi — ^ — Xi = j^) and then use the simple bound 

^ = n + 

/cr+(i) /cr+{i) ier+(i) 

/eind 

On the other hand, Shearer’s Lemma (Lemma 15.121) strengthens Theorem 15.401 in the sense that (ICLLI) 
implies p € S. This fact was established by Bissacot et al. [Si] by analytic methods that relied on earlier 
results ifT/l . In this section we establish this fact by a new proof that is elementary and self-contained. 

An algorithmic form of Theorem 15.401 in the variable model was proven by Pegden 1331. In fact, that 
result is subsumed by the algorithm of Kolipaka and Szegedy in Shearer’s setting, since (ICLLI) implies 
p G 5. In this section, we prove a new algorithmic form of Theorem 15.401 in the general framework of 
resampling oracles. 

To begin, we establish the following connection between the pi parameters and the qs polynomials. For 
convenience, let us introduce the notation S''^ = [n] \ S', S' + a = S' U {a} and S — a = S \ {a}. 

Lemma 5.41. Suppose that p satisfies (ICLLI) . Then, for every S C [n] and a € S, we have 

Qs ^ 

QS-a ~ Y{S-aT 

The proof is in Section [5.7.1l below. 

Corollary 5.42 ( (ICLLI) implies Shearer). Ifp satisfies (ICLLI) then p ^ S. 

Proof For any S C [n], write it as S = {si,..., s^}. Applying Lemma [5.41 I repeatedlv. we obtain 

75 ^ TT Q{si,...,Si} ^ TT ^ > 0 

% fJlQ{su-,Si-i} ~ Y[n] 

since Yt > 0 for all T C [n] under the (ICLLI) criterion. Recall that = 1. Hence qs > 0 for all S C [n], 
which means that p is in the Shearer region. □ 

Corollary 5.43. Ifp satisfies (ICLLI) then < ya. 

Proof Lemma [5AT] yields i the result follows from Claim IS25l □ 

y[n] 

These corollaries lead to our algorithmic result under the cluster expansion criterion. The following 
theorem subsumes Theorem 1 1.3 1 and adds a statement under the assumption of slack. 


38 














































Theorem 5.44. Let Ei,..., En be events and letpi = Pr^[£'j]. Suppose that the three subroutines described 
in Section U.l.l \ exist. If p satisfies (ICLLI) then, with probability at least 1 — e“*, MaximalSetResample 
resamples no more than ^'^(1 + Vj) + 1 + i) events. 

If (ICLLI) is satisfied with a slack of e € (0,1), i.e., (1 + e)pi < 2/i/Lr+(i)> tben with probability at least 
1 — e~^, MaximalSetResample resamples no more than + Vj) + t) events. 

Proof. The first statement follows directly from Theorem 15.351 since Corollary 15.421 shows that p ^ S 
and Corollary 15.431 shows that < y,. Next assume that (ICLLI) is satisfied with e slack. We apply 
Corollary I5.42l and Corollarv l5.43l to the point p' = (1 + e)p, obtaining thatp' E S and < yj, where 

q'g denotes qs{p')- The second statement then follows directly from Corollary 15.311 □ 


5.7.1 Proof of Lemma 15.411 

Claim 5.45 (The “fundamental identity” for Y). Ya = YA-a + yaYA\r+{a)faf all a £ A. 

Proof. Eyery summand y'’ on the left-hand side either appears in YA-a if a 0 J, or can be written as ya ■ y^ 
where B = J \ r+(a), in which case it appears as a summand in ya^yi\r+(a)- 

Claim 5.46 (Log-subadditiyity of Y). Yaub < ■ Ysfar any A,BC [n]. 

Proof. It suffices to consider the case that A and B are disjoint, as replacing B with B \ A decreases the 
right-hand side and leayes the left-hand side unchanged. Eyery summand on the left-hand side can be 
written as ■ y^ with J' = J n A and J" = J D B. The product y^ ■ y^ appears as a summand on the 
right-hand side, and all other summands are non-negatiye. □ 

Proof (of Eemma 15.411) . We proceed by induction on |5|. The base case is S' = {a}. In that case we haye 
= Q{a} = 1 — Pa- On the other hand, by the two claims aboye and (ICEEI) . we haye 

^[n] ^[n]—a T ya^[n]\r+(a) — ^[n]—a T Pa^^+(a)^[n]\r+(a) — ^[n]—a T Pa^\n\- 

Therefore, < 1 — Pa which proyes the base case. 

w _ 

We proye the inductiye step by similar manipulations. By Claim 15. 131 we haye 


h _ , ^ fe\r+(a) 

w — Pa w 

QS—a QS—a 


The inductiye hypothesis applied repeatedly to the elements of S D r(a) yields 


1 -Pa 


Qs\r+{a) 

qs-a 


> 1 -Pa- 


17 


(5\r+(a))= 


17 


(S-ap 


= 1 - Pa¬ 


re 


S^ur+(a) 

Ys^+a 


By the two claims aboye and (ICEEI) . we haye 


Ys^^+a — Ysc + yaYgc\-p+(^a) > l5= + Palr-l-(a)l5=\r-l-(a) > Ifi'^ + Pals'^uF-l-ta) • 

We conclude that 

qs . 1 ^S'^ur+fa) Ysc+a - Ys<^ Ysc 

- - > 1-Pa—r;- > 1-77-= 77-• 

qs-a rs^+a rS^+a L(^S-ap 


□ 
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5,7.2 Relationship between cluster expansion and stable set sequences 

We remark that the following more general bound holds: For every J G Ind, 

X] PI = — < y^- (12) 

XGProp(J) 

The equality holds by Lemma 15.27 1 and the inequality can be derived from Lemma 15.41 1 as follows: 


Sd. 

% 


P^y{r+{j)Y ^ pj ^r+(j) 


p^yv+{j) < 


using Claim [546] for the first equality, and Claim l546l and (ICLLI) in the last two inequalities. 

A direct proof that XliGPropfj) Pi — V'’ 1*® obtained by an inductive argument similar to the proof 

of (|4|) in Section ls!2l An application of Lemma B .27 1 then establishes ([T^ . Earlier versions of this paper 
used this approach to relate the cluster expansion criterion and Shearer’s lemma. Our new approach in 
Corollary 15 .421 has the advantage that it does not require the limiting arguments used in Lemma 15.27 1 


6 Conclusions 

We have shown that the Lovasz Local Lemma can be made algorithmic in the abstract framework of re¬ 
sampling oracles. This framework captures the General LLL as well as Shearer’s Lemma in the existential 
sense, and leads to efficient algorithms for the primary examples of probability spaces and events satisfying 
lopsidependency that have been considered in the literature (as surveyed in ll26ll l. 

Our algorithmic form of the General LLL (Theorem 1 1.21) uses 0( Si=i r^) resam¬ 
pling operations, which is roughly quadratically worse than the bound of Moser-Tardos 1^ . 

Similarly, our algorithmic result under Shearer’s condition (Theorem 15. 3 5!) uses 6)( 

2^)) resampling operations, which is roughly quadratically worse than the bound of Kolipaka- 

Szegedy Ii24l . Can this quadratic loss be eliminated? 

One way to prove that result would be to prove an analog of the witness tree lemma, which is a cen¬ 
terpiece of the Moser-Tardos analysis ||30|. The witness tree lemma has other advantages, for example in 
deriving parallel and deterministic algorithms. Unfortunately, the witness tree lemma is not true in the gen¬ 
eral setting of resampling oracles (see Appendix O. It is, however, true in the variable model Ii30l as well 
as in the setting of random permutations ||2T1. Is there a variant of our framework in which the witness tree 
lemma is true, and which continues to capture the LLL in full generality? 
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A A counterexample to the witness tree lemma 


A cornerstone of the analysis of Moser and Tardos OOl is the witness tree lemma. It states (roughly) that 
for any tree of events growing backwards in time from a certain root event Ei, with the children of each 
node Ei' being neighboring events resampled before E^i, the probability that this tree is consistent with the 
execution of the algorithm is at most the product of the probabilities of all events in the tree. (We give a more 
precise statement below.) Extensions of this lemma have been crucial in the work of Kolipaka-Szegedy on 
algorithmic forms of Shearer’s Lemma fIM and work of Harris-Srinivasan on the algorithmic local lemma 
for permutations 11211 . The witness tree lemma leads to somewhat stronger quantitative bounds than the 
ones we obtain, and it has been also useful for other purposes: derandomization of ELL algorithms 1301 [131, 
parallel algorithms 1301 HU, and handling exponentially many events l20l . Therefore, it would be desirable 
to prove the witness tree lemma in our general framework of resampling oracles. 

Unfortunately, this turns out to be impossible. The main purpose of this section is to show that the 
witness tree lemma is false in the framework of resampling oracles in a strong sense. Whereas in typical 
scenarios the Moser-Tardos algorithm only requires witness trees of depth O(logn) with high probability, 
in the resampling oracle framework the stable set sequences (and an analogous notion of witness trees) can 
have nearly-linear length with constant probability. 

Before we proceed, we define a few notions necessary for the formulation of the witness tree lemma. 
Our definitions here are natural extensions of the notions from |[30l to the setting of resampling oracles. 

Definition A.l. Given a lopsided association graph G on vertex set [n], a witness tree is a finite rooted tree 
T, with each vertex v in T given a label G [n], such that the children of a vertex v receive labels from 

r+(f.). 

Definition A. 2. We say that a witness tree T with root r appears in the log of the algorithm, if event £r is 
resampled at some point and the tree is produced by the following procedure: process the resampled events 
from that point backwards, and for each resampled event j such that j G (Sy) for some v in the tree, pick 
such a vertex v of maximum depth in the tree and create a new child w of v with label £y, = j. 

The witness tree lemma, in various incarnations, states that the probability of a witness tree T appearing 
in the log of an ELL algorithm is at most OijeT show here that this can be grossly violated in 

the setting of resampling oracles. Our example actually uses the independent variable setting but resampling 
oracles different from the natural ones considered by Moser and Tardos. 

Example. Consider independent Bernoulli variables Xi,Y- , Zi and W where 1 < i < k and 1 <j<t 
The probability distribution p is uniform on the product space of these random variables. Consider the 
following events: 

• Ei = {Xi = 0 } 

• Ej = {Yi = O} 

• E' = {W = l} 

These events are mutually independent. However, let us consider a dependency graph G where Ei El for 
each 1 < i < k,l < j < t, this is a conservative choice but nevertheless a valid one for our events. (One 
could also tweak the probability space slightly so that neighboring events are actually dependent.) In any 
case, E' is an isolated vertex in the graph. 

We define resampling oracles as follows. In the following, Q describes a fresh new sample of a Bernoulli 
variable. Only the variables relevant to the respective oracle are listed as arguments. 
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• ri{Xi) = Q 

. ri{Xi,Y^,Zi) = {Zi,Q,Xi) 

. r'{W,Zu...,Zk) = {Zi,...,Zk,Q). 

Claim A.3. ri,rl, r' are valid resampling oracles for the events Ei,E^, E' and the dependency graph G. 

Proof. Vi resamples only the variable Xi relevant to event Ei and henee eannot eause any other event to 
oeeur. Conditioned on Ei = {Xi = 0}, it elearly produees the uniform distribution. 

rj switehes the variables Xi and Zi and thus ean eause Ei to oeeur (whieh is eonsistent with the depen- 

deney graph G). Conditioned on Ej = it makes y/ uniformly random and preserves a uniform 

distribution on {Xi, Zi). 

r' affeets the values of W, Zi,..., Z^ but no event depends on Zi,..., Z^, so r' eannot eause any 
event exeept E' to oeeur. Conditioned on E' = {W = 1}, sinee {Zi ,..., Zk) are distributed uniformly, it 
produees again the uniform distribution. □ 

The Moser-Tardos algorithm. First, let us eonsider the Moser-Tardos algorithm: In the most general 
form, it resamples in eaeh step an arbitrary oeeurring event. For eonereteness, let’s say that the algorithm 
always resamples the oeeurring event of minimum index (in some fixed ordering). 

Claim A.4. If the Moser-Tardos algorithm considers events in the order {Ei, Ej,E'), then at the time it gets 
to resample E', the variables Zi,..., Z^ are independent are equal to 1 with probability 1 — 1/2^+^ each. 

Proof. Let us fix i. Whenever some variable Y? is initially equal fo 0, we have fo resample El af some 
poinf. However, we only resample El if Ei does nof oeeur, whieh means fhaf Xi musf be 1 af fhaf fime. So 
fhe resampling oraele El forees Zi fo be equal fo 1. The only way Zi eould remain equal fo 0 is fhaf if is 
initially equal fo 0 and none of fhe evenfs El need fo be resampled, whieh happens wifh probabilify 1/2^. 
Therefore, when we’re done wifh Ei and El for 1 < j < I, Zi is equal fo 0 wifh probabilify 1/2^+^. This 
happens independenfly for eaeh i. □ 

Lemma A.5. The probability that the Moser-Tardos algorithm resamples E' at least k times in a row is at 
least ^(1 — 

Proof. By fhe ordering of evenfs, E' is resampled only when all ofher evenfs have been fixed. Also, re¬ 
sampling E' eannof eause any ofher evenf, so fhe algorifhm will terminate afterwards. However, as we 
argued above, when we get to resampling E', eaeh variable Zi is equal to 1 independently with probability 
1 — 1/2^+^. Considering the resampling oraele r'{W, Zi,..., Zk) = {Zi,..., Z^, Q), if W as well as all 
the variables Zi are equal to 1, it will take at least k resamplings to elear the queue and get a ehanee to avoid 
event E'. This happens with probability ^(1 — 

Let T eonsist of a path of k vertiees labeled E'. For k = 2^, we eonelude that the witness tree T appears 
with eonstant probability in the log of the Moser-Tardos algorithm, as opposed to 1 /2^ whieh would follow 
from the witness tree lemma. 

The MaximalSetResample algorithm. A slightly more involved analysis is neeessary in the ease of 
MaximalSetResample. By nature of this algorithm, we would resample E' “in parallel” with the other 
events and so the variables evolve somewhat differently. 
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Claim A,6. For each i independently, after! iterations of the MaximalSetResample algorithm, Zi = 1 with 
probability 1 — Any further updates of Zi other than those caused by resampling E' can only change 
the variable from 0 to 1. 

Proof The claim is that unless Zj = 0 and = \ initially, in the first two iterations we 

will possibly resample Ei and then one of the events El, which makes Zi equal to 1. Any further update to 
Zi occurs only when E' is resampled (which shifts the sequence (Zi,..., Z^)) or when E^ is resampled, 
which makes Zj equal to 1. □ 


Lemma A.7. The probability that MaximalSetResample resamples E' at least k times in a row is at least 

1/-I_ )_\k-2 

2^+1 1 


Proof In the first two iterations, the probability that E' is resampled twice is at least 1/4 (the values of W 
and Zi are initially uniform, and if Zi is updated, it can only increase the probability that we resample E'). 
Independently, the probability that Z 2 = ... = Z^-i = 1 after the first two iterations is (1 — 1/2^+^)^“^, by 
the preceding claim. (We are not using Zi which is possibly correlated with the probability of resampling 
E' in the second iteration, and Z^ which would be refreshed by this resampling in the second iteration.) If 
this happens, we will continue to resample E' at least k — 2 additional times, because it will take k — 2 
executions of r' before a zero can reach the variable W. □ 


Again, consider setting k = 2^. The total number of events is n = 0{k£), so i = 0(logn) and 
k = 0(n/logn). With constant probability, the witness tree T consisting of a path of k vertices labeled 
E' will appear in the log of MaximalSetResample algorithm. Thus, with constant probability, the algorithm 
will require a stable set sequence of length at least k. 
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